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Digital Controller 

Many real-time systems are embedded in sensors and actuators and function as digital controllers. Figure, below shows 
such a system. The term plant in the block diagram refers to a controlled system. An engine, a brake, an aircraft, a patient 
are some examples. The state of the plant is monitored by sensors and can be changed by actuators. The real-time system 
estimates from the sensor readings the current state of the plant and computes a control output based on the difference 
between the current state and the desired state (called reference input shown in the figure). This computation is called the 
control-law computation of the controller. The output generated activates the actuators, which bring the plant closer to the 
desired state. 

A simple example is shown below: consider an analog single-input/single-output PID (Proportional, Integral, and 
Derivative) controller. This simple kind of controller is commonly used in practice. The analog sensor reading y(t) gives 
the measured state of the plant at time t. Also, let e(t) = r(t)— y(t) denote the difference between the desired state r(t) and 
the measured state y(t) at time t. The output u(t) of the controller consists of three terms: a term that is proportional to e(t), 
a term that is proportional to the integral of e(t) and a term that is proportional to the derivative of e(t). (Liu 2) 



a digital controller 


High-level Controls 

The controllers in a complex monitor and control system are often organized in a hierarchy. There can be multiple control 
loops and the high-level controller interfaces with the operator and monitors the behavior of low-level controllers. The 
output of high-level controller acts as the reference input of low-level controller. The time scale and the complexity of 
decision making increases as we group the hierarchy. One or more low-level digital controller directly control the physical 
plant. The periods of low-level control law computation is millisecond to second level and is simple while the periods of 
high-level control can be from few minutes to many and is complex. When we move from low level to high level, the 
system moves from direct control to the planning. 
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fig: an air traffic control hierarchy 


Guidance and Control 

While a digital controller deals with some dynamical behavior of the physical plant, a second level controller typically 
performs guidance and path planning functions to achieve a higher level goal. In particular, it tries to find one of the most 
desirable trajectories among all trajectories that meet the constraints of the system. The trajectory is most desirable 
because it optimizes some cost function. The algorithm used for this purpose is the solution of some constrained 
optimization problem. 

For an example, look again at a flight management system. The constraints that must be satisfied by the chosen flight path 
include the ones imposed by the characteristics of the aircraft, such as the maximum and minimum allowed cruise speeds 
and decent/accent rates, as well as constraints imposed by external factors, such as the ground track and altitude profile 
specifiAAAA* ed by the ATC system and weather conditions. A cost function is fuel consumption: A most desirable 
flight path is a most fuel efficient among all paths that meet all the constraints and will bring the aircraft to the next 
metering fix at the assigned arrival time. This problem is known as the constrained fixed-time, minimum-fuel problem. 
When the flight is late, the flight management system may try to bring the aircraft to the next metering fix in the shortest 
time. In that case, it will use an algorithm that solves the time-optimal problem. 

Real-Time Command and Control 

The controller at the highest level of control hierarchy is called command and control. For example, air traffic control 
(ATC) system architecture shown in the figure monitors the air crafts in its coverage environment and generates and 
provides the information needed by the operator. The output information from ATC system includes the assigned arrival 
times to the metering fixes and these inputs acts as on board flight management system called physical plant. Therefore, 
the ATC system indirectly controls the embedded components of low levels of control hierarchy. Similarly, the ATC 
system provides voice and telemetry services to onboard avionics. Therefore, the commands and control fascinates both 
high level and low-level control. 

The ATC System collects information about the state of the aircrafts through the sensors of radars. The state variables 
include identifier, position, altitude, heading, distance, speed and so on. These variable collectively are called track record 
and the current trajectory of the aircraft is a track. The information collected is processed by the DSPs and is stored in the 
database. The stored data is further processed by DPs and displayed by the display on. The surveillance system 
continuously analyzes the scenario and alerts the operator whenever it detects any potential hazards. 
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Real Time Application 


l. Digital controllers make three assumptions: 


1. sensor data gives accurate estimates of the state - variables being being monitered and controlled - noiseless 

2. the sensor data gives the state of the plant - usually most compute plant state from measured value 

3. all the parameters representing the dynamic of the plant are known. 

2. When we move from low level to high level, the system moves from direct control to the planning. 


3. The controller at the highest level of control hierarchy is called command and control. 


1.1 Signal Processing 

A real time signal processing applications compute in each sampling period one or more outputs. Each output x(k) is a 
weighted sum of n inputs y(i). That is, 
n 

jE(*) = £a(*,/)>(i) 

i'=l 

The weight a(k, i) are generally known and fixed. This computation transforms the given representation of an object such 
as voice or radar signal in terms of the input y(i), into another representation in terms of the outputs x(k)’s. 

(Low level controller workload is purely or mostly periodic) 

Radar System 

A signal processing application is typically a part of a larger system. As an example, Figure below shows a block diagram 
of a passive radar signal processing and tracking system. The system consists of an I/O subsystem that samples and 
digitizes the echo signal from the radar and then, places the sampled values in a shared memory. An array of digital signal 
processors processes these sampled values. The data thus produced are analyzed by one or more data processors that not 
only interface with the display system, but also generate commands to control the radar and select parameters to be used 
by signal processors in the next cycle of data collection and analysis. 

Radar Signal Processing 

To search for objects of interest in its coverage area, the radar scans the area by pointing its antenna in one direction at a 
time. During the time the antenna dwells in a direction, it first sends a short radio frequency pulse. It then collects and 
examines the echo signal returning to the antenna. 
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fig: radar signal 


The echo signal consists solely of background noise if the transmitted pulse does not hit any object. On the other hand, if 
there is a reflective object (e.g., an airplane or storm cloud) at a distance x meters from the antenna, the echo signal 
reflected by the object returns to the antenna at approximately 2x/c seconds after the transmitted pulse, where c =3x108 
meters per second is the speed of light. The echo signal collected at this time should be stronger than when there is no 
reflected signal. If the object is moving, the frequency of the reflected signal is no longer equal to that of the transmitted 
pulse. The amount of frequency shift (called Doppler shift) is proportional to the velocity of the object. Therefore, by 
examining the strength and frequency spectrum of the echo signal, the system can determine whether there are objects in 
the direction pointed at by the antenna and if there are objects, what their positions and velocities are. (Liu 15) 

Gating and Association 

Gating and association is used to remove false trace and trajectory. Strong noise and man-made interferences can make 
signal processing and detection process wrong about the presence of objects a process wrong about the presence of 
objects. A track record on a non- existing object is called a false return. An application that examines all the track records 
in order to sort out the false returns from real ones and update the trajectories of detected objects is called a tracker. The 
tracker assigns each measured value to a trajectory. If the trajectory is of existing one, the measured value assigned to it 
gives the current position and velocity of the object moving along the trajectory but if the trajectory is new one, the 
measured value gives the position and velocity of possible new objects. The tracking of the object is performed in two 
steps: Gating and Association. 

Gating 

Gating is the process of putting each measured value into one of two categories depending on whether it can or cannot be 
tentatively assigned to one or more established trajectories. The value is assigned to an established trajectory if it is within 
a threshold distance ‘G’ from the predicted current position and velocity of the object moving along the trajectory. The 
threshold ‘G’ is called track gate. 

Association 

The tracking process completes if after gating every measured value is assigned to at most one trajectory and every 
trajectory is assigned to at most one measured value. This can occur when radar signal is strong and interferences is low 
and the density of object is low. Sometimes, the result of gating can be confusing and some measured values are assigned 
to more than one trajectory is assigned to more than one measured value. In this case, data association is required to 
complete the assignment and resolve the ambiguity. 
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The data association algorithm given below can be used to remove the ambiguity. 

1. Examine the tentative assignments produced by the gating step. 

2. For each trajectory that is tentatively assigned a single unique measured value, assign the measured value to the trajectory. 
Discard from further examination the trajectory and the measured value, together with all tentative assignments involving 
them. 

3. For each measured value that is tentatively assigned to a single trajectory, discard the tentative assignments of those 
measured values that are tentatively assigned to this trajectory if the values are also assigned to some other trajectories. 

4. Sort the remaining tentative assignments in order of non-decreasing distance. 

5. Assign the measured value given by the fiAAA* rst tentative assignment in the list to the corresponding trajectory and 
discard the measured value and trajectory. 

6. Repeat step (3) until the list of tentative assignments is empty. 


Other Real Time Applications 

The two most common real-time applications are real-time analysis and multimedia applications. These are described 
below. 


Applications 

Size 

Resp. Time 

Resp. Time 

Abs. Cons. 

Rel. Cons. 

Permanence 

Air traffic control 

20,000 

0.50 ms 

5.00 ms 

3.00 sec. 

6.00 sec. 

12 hours 

Aircraft mission 

3,000 

0.05 ms 

1.00 ms 

0.05 sec. 

0.20 sec. 

4 hours 

Spacecraft control 

5,000 

0.05 ms 

1.00 ms 

0.20 sec. 

1.00 sec. 

25 years 

Process control 


0.80 ms 

5.00 sec 

1.00 sec. 

2.00 sec 

24 hours 


Table - Requirements 

of typical real-time databases 

Real Time Databases 

The term real-time data base systems refers to a diverse spectrum of information systems that ranges from stock price 
quotation systems, to track records databases, to real-time file systems. In the table shown above, lists several examples. 
The perishable nature of the data maintained by the databases distinguish them from non-real time databases. Specifically, 
a real-time database contains data objects, called image objects, which represent real-world objects. The attributes of an 
image object are those of the represented real world object. As an example, an air traffic control database contains image 
objects which represent aircraft in the coverage area. The attributes of that kind of image object include the position and 
heading of the aircraft. The values of these attributes are updated periodically based on the measured values of the actual 
position and heading. The values of the actual position and headings are provided by the radar system. Without this 
update, the stored position and heading will deviate more and more from the actual position and heading. In away, the 
quality of stored data degrades. This is reason, real-time data are said to be perishable. In contrast, an underlying 
assumption of non-real-time databases (for example, a payroll database) is that in the absence of updates the data 
contained in them remain good (that is., the database remains in some consistent state satisfying all the data integrity 
constraints of the database).(Liu 19-21) 

Absolute Temporal Consistency: A set of data objects is said to be absolutely (temporally) consistent if the maximum 
age of the objects in the set is no greater than a certain threshold. 

Relative Temporal Consistency: A set of data objects is said to be relatively consistent if the maximum difference in 
ages of the objects in the set is no greater than the relative consistency threshold used by the application. 


Multimedia Applications 

Multimedia is the most frequently encountered real-time applications. A multimedia application may process, store, 
transmit, and display any number of video streams, audio streams, images, graphics, and text. A video stream is a 
sequence of data frames which encodes a video. An audio stream encodes a voice, sound, or music. Without compression, 
the storage space and transmission bandwidth required by a video are enormous. 
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(For example, consider a small 100x100-pixel, 30-frames/second color video. The sample values of a luminance and two 
chrominance signal components gives the intensity and color of each pixel, respectively, at the location of the pixel. If they 
are not compressed, the video requires a transmission bandwidth of 2.7 Mbits per second when the value of each 
component at each pixel is encoded with 3 bits.) Therefore, a video stream and the associated audio stream, is invariably 
compressed as soon as it is captured. (Liu 22) 


Signal Processing 


1. The radar scans the area by pointing its antenna in one direction at a time to search for objects of interest in its coverage 


2. During the time the antenna dwells in a direction, it first sends a short radio frequency pulse. It then collects and examines 
the echo signal returning to the antenna. 

3. Gating is the process of putting each measured value into one of two categories depending on whether it can or cannot be 
tentatively assigned to one or more established trajectories. 

4. The two most common real-time applications are real-time analysis and multimedia applications. 

5. Real-time data base systems refers to a diverse spectrum of information systems that ranges from stock price quotation 
systems, to track records databases, to real-time file systems. 

6. A multimedia application may process, store, transmit, and display any number of video streams, audio streams, images, 
graphics, and text. 
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2.1 Real Time System 

The real-time system covers a broad spectrum of automated platform in which the correctness of the system only requires 
or logical correct option but also produces the result within the pre-specified real time constraints. 

The non-real-time system is the one for which there is no deadline, even if fast response or high performance is required. 
The real-time system is usually situated in an environment and involved sensing devices to detect, control and adapt to the 
environmental conditions. The real time system can be networked or distributed or embedded. 

Jobs and Processors 

A job executes or is executed by the operating system on a processor and may depend on some resources. A job is a unit 
of work that is scheduled and executed by a system (slideplayer). For example, transmission of data package or retrieval of 
a file. 

A processor is an active component on which the jobs are scheduled. For example, the data scheduled on a transmission 
link, read-write request scheduled on a disk, etc. Each processor has a speed attribute which determines the rate of 
progress a job makes toward completion. Two processors are of same type if they are functionally identical and can be 
used interchangeably. 

A task is a set of related jobs which jointly provide some function. For example, the set of jobs that constitute the 
“Maintain, Constant, and Altitude” task keeping an aeroplane flying at a constant altitude. 

Execution time 

The amount of time required to complete the execution of the job Ji when it executes independently and has all the 
resources it needs. The value of execution time c,depends upon the complexity of job and speed up the processor on which 
it is scheduled. The execution time falls into an interval [er, es + ]. 

Release time and Response time 

The release time is the instant time when a job becomes available for execution. It may not be exact due to release time 
Jitter which is the interval [ri-, ri+], A job can be scheduled and executed at any time after its release time provided that its 
resource dependency conditions are made. 

The response time is the length of time from the release time of the job to the time instant when it completes. The response 
time may not be the same as execution time because the job may not execute continuously. 



The constraints imposed on the timing behaviour of the job is called timing constraint. It can be expressed in terms of 
response time and relative or absolute deadlines. 

The instant of the time at which a job completes execution is called completion time. The maximum time allowed to the 
job for its job response time is called relative deadline (Di). The instant of time for which a job is required to be completed 
is called absolute deadline or simply a deadline. A job must be completed by the release time of subsequent job. 
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The feasible interval for a job, Ji is the interval between the release time and absolute deadline i.e. [ri, di]. 



Suppose each job must complete before release of next job. 


• It’s relative deadline is 100ms ms. 

• It’s absolute deadline is 20 + ((i + 1) * 100) ms. 

References 
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Real Time System 

• Note 


[ Things To Remember] 


1. A job is a unit of work that scheduled and executed by a system. 

2. A task is set of related jobs. 

3. A processor is an active component on which the jobs are scheduled. 

4. Execution time is the amount of time required to execute job Ji. 

5. the instant in time when a job becomes available for execution is called release time. 

6. The response time is length of time from the release time of the job to the instant when it completes. 
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2.2 Hard vs Soft Real Time Systems 


Processor and Resources 

A job executes or is executed by the operating system on a processor and may depend on some resources. A processor is 
an active component on which the jobs are scheduled. For example, the data scheduled on a transmission link, read-write 
request scheduled on a disk, etc. Each processor has a speed attribute which determines the rate of progress a job makes 
toward completion. Two processors are of same type if they are functionally identical and can be used interchangeably. 

A resource is a passive component upon which jobs may depend on. For example, memory, database lock, etc. The 
resources have different types and sizes but do not have a speed attribute. They are usually reusable and are not consumed 
by use. 

If the system contains p different types of resources, then it means 

1. There are p different types of serially reusable resources. 

2. There are one or more units of each type of resource and only one job can use each unit at once. 

3. A job must obtain a unit of needed resource, use it and then, release it. 

A resource is plentiful if no job is ever prevented from executing by the unavailability of the resource. The job never block 
when attempting to obtain a unit of a plentiful resource. 

Hard timing constraints and soft timing constraints 

Timing constraints can be divided into two types: Hard and Soft. There are numerous definitions of hard and soft timing 
constraints. 

A timing constraint is hard if the failure to meet it is considered a fatal error and the late completion becomes disastrous. 
This definition is based on the functional criticality of the job. A timing constraint is soft if the usefulness of the result falls 
off abruptly or may even become negative at the deadline. In this case, we may need validation of the system to confirm 
that the system always needs timing constraints. A hard deadline is imposed on a job because a late result produced by the 
job after the deadline may have disastrous consequences. (For example, a late command to stop a train may cause a 
collision, and a bomb dropped too late may hit a civilian population instead of the intended military target.) 

If some deadlines can be missed occasionally with low probability then the system is called soft real-time system. In this 
case, the deadline is called soft deadline and few misses of soft deadline to know serious damage, but the performance 
becomes poorer and poorer when more and more jobs with soft deadline complete late. 

Hard Timing Constraints and Temporal Quality-of-Service Guarantees 

The timing constraint of a job is hard timing constraint, and the job is a hard real-time job, if the user requires the 
validation that the system always meet the timing constraint. By validation, we mean a demonstration by a provably 
efficient and correct procedure or by exhaustive simulation and testing. 

On the other hand, if no validation is required, or only a demonstration that the job meet some statistical constraint (that is, 
a timing constraint spccii'-iAAAAAAA* ed in terms of statistical averages) is enough, then the timing constraint of the job 
is soft. The satisfaction of statistical constraints (for example, the average number of missed deadlines per minute is two or 
less) can usually be demonstrated with a performance pro^-'AAAAAAA• le somewhat more thorough than those that are 
used to demonstrate the performance of general interactive systems. 

It can be stated in another way as, if the user wants the temporal quality (for example, response time and jitter) of the 
service provided by a task are guaranteed and the satisfaction of the timing constraints dcHAAAAAAA* ning the 
temporal(Liu 29) quality are validated, then the timing constraints are hard. On the other hand, if the user wants the best 
quality of service that the system can provide but allows the system to deliver qualities only below what is 
dcHAAAAAAA* ned by the timing constraints, then the timing constraints are soft. 
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Hard Real-Time System 

If a job must never misses its deadline then the system is called hard real-time system. For a hard real-time system, every 
deadline must be hit. In a real hard real-time system, if the system fails to hit the deadline even once the system is said to 
have failed. 

A hard real-time system is also known as an immediate real-time system. It is a hardware or software that must operate 
within the confines of a stringent deadline. The application is considered to have failed if it does not complete its function 
within the given allocated time span. Some examples of hard real-time systems are medical application like pacemakers, 
aircraft control systems and anti-lock brakes. 

• The hard real-time system is called guaranteed services. 

• Response time is hard 

• Often safety critical 

Soft Real Time system 

If some deadlines can be missed occasionally acceptably with low probability then the system is called soft real time 
system. In a soft real time system, even if the system fails to meet the deadline one or more than once, the system is still 
not considered to have failed. For example, streaming audio-video. 


The soft real-time system i 
Response time is soft 
Non-critical 

s called best effort service. 


Item 

Hard 

Soft 

size of data files 

small and medium 

large 

peal load performance 

predictable 

degraded 

data integrity 

shortterm 

longterm 

error detection 

autonomous 

user supported 

control of pace 

environment 

copmuter based 
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Hard vs Soft Real Time Systems 


| Things To Remember | 


A processor is an active component on which the jobs are scheduled. E.g, the data scheduled on a transmission link. 

A resource is a passive component upon which jobs may depend on. For example, memory. 

Hard Real Time System - job never misses deadline. 

Soft Real Time System - job occasionally misses deadlines. 

A timing constraint is hard if the failure meet it is considered fatal error. 

A timing constraint is soft if the usefulness of the result falls off abruptly or may even become negative at the deadline. 
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3.1 Reference Model of RTS 


Reference Model of Real Time System 

The model that focuses on the relevant characteristic such as timing properties , resource requirement of the system 
components and the way in which the operating system allocates the available system resources among them is called 
reference model of RTS. Its main goal is to abstract away from the functional characteristics and focus on the living 
properties and resource requirement. According to this model, each system is characterized by three elements. 

1. A workload model that describes the application supported by the system. 

2. A resource model that describes the system resources available to the application. 

3. Algorithms that define how the application system uses the resources at all the times. 

Each job is characterized by 

1. Temporal Parameters 

2. Interconnection Parameters 

3. Resource Parameters 

4. Functional Parameters 

Temporal Parameters 

The temporal parameter of a job is the parameter which describes its timing constraints and behavior such as release time 
(n), absolute deadline (&), relative deadline (DO, execution time (e,) and feasible interval [r,, d,]. 


Fixed, Jitter and Sporadic Release Time 

Release time may exactly may not be known, only that ri is in a range [rr, rr|. ri can be as earliest as ri- and as late as latest 
release time ri+. When only the range of ri is known, then this range is called jitter in ri or the release time Jitter. When 
the jitter is negligible compared to other temporal parameters, then the actual release time of each job is given by its 
earliest or latest release time then we can call it a fixed release time. 

Every real time system is required to respond to unexpected external events which occurs at random instance of time. 
During such events the system executes a set of jobs in response. The release times of these jobs are known until the 
events triggering them occur. These jobs are called sporadic jobs or aperiodic jobs because they are released at random 
instant of time. Though both of these jobs occur at random instant, the sporadic jobs have hard timing constraints and 
aperiodic jobs have soft timing constraints. 


Periodic Task Model 

It is a deterministic work load model used to describe hard real time application such as digital control and real time 
monitoring system. There are two methods and tools to support the design analysis and validation of real time systems 
supported by basic model. 

The period, execution time and the phases of the task play important role in this model. When each computation or data 
transmission is executed repeatedly at regular intervals or semi-regular intervals in order to provide a function of the 
system on a continuous basis, then the task is called a Periodic Task. The periodic task T is a sequence of jobs ji. When 
there are more than one task in the system then hyperbolic is considered as one of the important parameter. The hyperbolic 

of a set of periodic task is the L.C.M. of their periods Pi for i = 1, 2, 3,., n. The maximum number N of jobs in 

each hyper period is equal to 

EL, h/p, 


Where H = hyper period 

For example, the length of hyper period of the periodic task with periods with 3, 4 and 10 is 60 and total number of jobs in 
the hype period is 

[60/3 + 60/4 + 60/10] = 20 + 15 + 6 = 41 
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The period Pi of periodic task is the minimum length of all time intervals between released times of consecutive jobs in a 
task and execution time is the maximum execution time of all the jobs. The released time r u of the first job Ju, in each task 
ti is called phase of the task (cp). Therefore, cp = r u . 

The ratio of the execution time to the period of the job is called utilization i.e. ui = ei/p>. 

It is equal to the fraction of time, a truly periodic task with period Pi and execution time ei. It keeps the processor busy. It 
gives an upper bound to the utilization of any task ti. The total utilization of all the tasks in the system is the sum of the 
utilizations of the individual tasks in it. Therefore, Ui =X Ui, (n , i = 1). 


Reference Model of RTS 


1. According to Reference model, each system is characterized by three elements. 


1. A workload model that describes the application supported by the system. 

2. A resource model that describes the system resources available to the application. 

3. Algorithms that define how the application system uses the resources at all the times. 

2. Temporal Parameter describes its timing constraints and behavior such as release time (n), absolute deadline (d), 
relative deadline (DO, execution time (&) and feasible interval |r,, d,|. 

3. When only the range of r, is known, then this range is called jitter in r, or the release time Jitter. 
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3.2 Precedence and Data Dependencies 


Precedence Constraints and Data Dependency 

The jobs in a task may be constrain to execute in a particular order and it is known as precedence constraint. The partial 
order relation, < is used to specify the precedence constraint among the jobs and is called precedence relation. A job Tis a 
precedence of another J k (J k is a successor of J k ) if J k cannot begin until the execution of T completes. It is denoted as T < J k 
In this case, Ji is an immediate predecessor of J k if J, < J k if there is no other job in between. Tand J k are said to be 
independent when neither of T < J k nor J k < I. It means they can execute in any order. A job with precedence constraint 
becomes ready for execution once when its release time has passed and when all the predecessors have completed. 

Precedence Graph and Task Graph 
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fig: 1 Examples of Task 


We can represent the precedence constraints among the jobs in a set J using a directed graph G = (J, <). Each node 
represents a job, a directed edge from Ji to J k if J, is an immediate predecessor of J k . This graph is called precedence graph. 
Many types of interactions and communications are not captured by a task graph. 

A task graph is an extended precedence graph, the vertices in the task graph represents jobs, number in brackets above 
each job represents feasible interval and the edges represent dependencies among the jobs but if all the edges are 
precedence edges representing precedence constraints then the task graph also becomes a precedence graph. 

The top row in the graph represents independent jobs as there is no edge to or from these jobs. It means the jobs are ready 
for execution once released even though other jobs released earlier are not complete. 

The precedence graph and task graph also contain different types of edges that represents different types of dependencies. 
The type of dependencies represented by an edge is given by the type of edge. 

Data Dependency 

The data dependency can’t be captured by the precedence graph. In the real time system, the jobs communicate through 
shared data and the producer and consumer jobs are not synchronized. The producer places the data generated by it in a 
shared address space to be used by the consumer at any time. The jobs are not constraints to be executed in order. In a task 
graph, the data dependencies among the jobs are represented explicitly by the data dependency edges among the jobs. 
There is a data dependency edge from a vertex Ji to J k if the job J k consumes data generated by Ji. The parameter of an edge 
from T to J k represents volume of data from Ji to J k and is called data volume parameter. 

Other Dependency 

Temporal Dependency 

Sometimes it is constraint to complete some jobs within a certain amount of time relative to one another. The difference in 
the completion time of two jobs is called temporary distance between them, jobs are said to be temporal distance 
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constraint if their temporal distance must be not more than some finite value. Such jobs may or may not have deadlines. 
Temporal dependencies edges from vertex L to J k if the job J k must be completed within a certain time after J, completes. 
The temporal distance between jobs is given by temporal distance parameter of the edge, the value of this parameter is 
infinite if the jobs have no temporal distance constraint or no temporal dependencies as between jobs. 

AND/OR Precedence Graph 

Normally, a job must wait for the completion of all the immediate predecessors and it is called AND precedence 
constraints and the jobs are called AND jobs. It is represented by unfilled circle in the task graph. 

OR constraints indicate that a job may begin at or after its release time if only one or some of the immediate predecessors 
have completed. It is represented by unfilled square in the task graph. 

The line joining two filled circles are represents the conditional block and dotted edge represents the producer consumer 
relationship in the task graph. 

Conditional Branch 

In the classical model, all the immediate successors of a job must be executed; an outgoing edge from every vertex 
expresses an AND constraint. This convention makes it inconvenient for us to represent conditional execution of jobs. 

This system can easily be modeled by a task graph that has edges expressing OR constraints. Only one of all the 
immediate successors of a job whose outgoing edges express OR constraints is to be executed. Such a job is called a 
branch job. In a meaningful task graph, there is a join job associated with each branch job. In Figure 1, these jobs are 
represented by i-iAAAAAAA* lied circles. The subgraph that begins from a vertex representing a branch job and ends at 
the vertex representing the associated join job is called a conditional block. Only one conditional branch in each 
conditional block is to be executed. The conditional block in Figure 1 has two conditional branches: Either the upper 
conditional branch, containing a chain of jobs, or the lower conditional branch, containing only one job, is to be executed. 
(Liu 46) 

Pipeline Relationship 

A dependency between a pair of producer-consumer jobs that are pipelined can theoretically be represented by a 
precedence graph. In this graph, the vertices are the granules of the producer and the consumer. Each granule of the 
consumer can begin execution when the previous granule of this job and the corresponding granule of the producer job 
have completed. Problem 3.3 gives an example of how this representation works, but the example also illustrates how 
inconvenient the representation is. For this reason, we introduce the pipeline relation between jobs. In the task graph, we 
represent a pipeline relationship between jobs by a pipeline edge, as exemphi-iAAAAAAA* ed by the dotted edge 
between the jobs in the right-bottom comer of the graph in Figure 1 .There is an edge from L to k if the output of J, is piped 
into J k and the execution of J k can proceed as long as there are data for it to process. (Liu 47) 


Precedence and Data Dependencies 


The jobs in a task may be constrain to execute in a particular order and it is known as precedence constraint. 

Precedence constraints among the jobs in a set J can be represented using a directed graph G = (J, <) which is called 
Precedence Graph. 

A task graph is an extended precedence graph. 

In a task graph, the data dependencies among the jobs are represented explicitly by the data dependency edges among the 
jobs. 

Other Dependencies : Temporal dependencies, AND/OR precedence graph, conditional branch, pipeline relationship. 
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3.3 Functional Parameters 


FUNCTIONAL PARAMETERS 

While scheduling and resource access control decisions are made disregarding most functional characteristics of jobs, 
several functional properties can affect these decisions. The workload model must explicitly describe these relevant 
properties which is done by the values of functional parameters. 

These includes: 

1. Preemptivity of jobs, 

2. Criticality of jobs, 

3. Optional execution, and 

4. Laxity type and laxity function. 

5. 

Preemptivity of Jobs 

Preemptivity of jobs means whether a currently executing job can be momentarily stopped and a new job having hyper 
priority can be executed or not. One of the higher priority job completes execution, the preemptive jobs resumes 
execution. During this process, currently executing job is stored in the memory and the job with higher priority is brought 
into the processor. This process is called context switch and the time required to perform context switch is called 
switching time. 

Criticality of Jobs 

All the jobs may not be equally important. The importance (criticality) of a job is represented by a positive number that 
indicates how critical the job is with respect to other jobs. The more critical the jobs, the larger its importance. Priority and 
weight are used to indicate the importance. The more important a job, the higher it’s priority or the larger its weight. 

During overload, less critical jobs are sacrificed so that critical jobs meet their deadline. Therefore, weighted performance 
measures such as weighted average response time or weighted average tardiness is used and optimized by scheduling and 
resource control algorithms. 

Optional Execution 

Optional means not mandatory. Therefore, optional job or optional portion of a job can be discarded even when executing. 
But the not optional or mandatory jobs must be executed to completion. The optional jobs are not critical and can be 
sacrificed to stop system degradation. If an optional job completes late or is not executed at all, the system performance 
may degrade but functions satisfactorily. 
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Laxity Type and Laxity Function 

Laxity type of a job means whether its timing constraints are hard or soft. Laxity type is supplemented by a usefulness 
function. This function gives the usefulness of the result produced by the job as a function of its tardiness. The tardiness 
the difference between the completion time and deadline of the job. 


• tardiness 


Examples of usefulness functions 

In the graph, the dark lines are called hard real time jobs. In this case, the result becomes zero or negative as soon as the 
job is tardy. Therefore, in the second case, it is better not to execute the job for example release of bomb by a fighter plane. 

In the graph, the dotted lines shows that the usefulness decreases gradually and becomes zero. For example, withdrawing 
money from ATM machines. 

The dashed line is a function that decreases faster and becomes negative. Such cases can occur while obtaining the stock 
price. 

The usefulness function is used to describe qualitatively the real time performance objective of the system. It can be 
helpful while choosing and implementing scheduling strategies. The usefulness means to specify the timing constraints for 
hard real time system and it should be small. 



Resource Parameter of Jobs and Parameter of Resources 

Resource parameter gives the resource requirements. Basic component available for an application are processors and 
resources. Processor is required throughout its execution. Similarly, it requires other resources. The resource parameter of 
a job gives the type of processor, the unit of each resource type required and the time intervals during its execution when 
the units are required. The information provided by resource parameter is used to support the resource management 
decision. 

Preemptivity of resource 


Resource parameters are used to view the processors and the resources from the perspective of the applications that 
execute on them. One of the resource parameters is preemptivity. A resource is non-preemptable if each unit of the 
resource is constrained to be used serially. Once a unit of resource is allocated to a job, other jobs needing it must wait 
until the job completes its use. 

If a job can use every unit of resource in a interleaved fashion, the resource is called preemptable. For example, the lock in 
a database and token ring are non-preemptable, and sending message using ATM is preemptable. 
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Resource Graph 

A graph that is used to describe the configuration of the resources is called resource graph. It gives the amount of 
resources available to execute the application, attribute of resources and rules governing their usage. In a resource graph, 
there is a vertex ft, for every processor or resource Ri in the system. The attributes of the vertex are the parameters of the 
resource. The resource type of the resource gives whether the resource is a processor or (passive) resource, and its number 
gives the number of available units. Edges in a resource graph represent the relationship among resources. Different types 
of edges are used to describe different configuration of the system. 

There are two types of edges. An edge from vertex Ri to R k means R k is a component of Ri. This edge is called is-a-port-of- 
edges. Subgraph containing all the is-a-part-of-edges is a major component which contains sub-components represented by 
vertices in the tree. 

The edges that represent connectivity between components are called accessibility edges. E.g. if there is a connection 
between two CPUs in two computers, then each CPU is accessible from other computer and there is an accessibility edge 
from each component to CPU on other computer as shown in model of RTS in above figure. Each accessibility edge may 
have several parameters whose value help to complete the description of interconnection of the resources. E.g. cost of 
sending a unit of data from a job executing on Pi to a job executing on P k can be represented by a parameter of an 
accessibility edge from Pi to P k . 


Scheduling hierarchy, Scheduler and Schedule 

Jobs are scheduled and allocated resources according to a chosen set of scheduling algorithms and resource access control 
protocols. The scheduler is a module that implements these algorithms. It specifically assigns jobs to processors. The total 
amount of time assigned to a schedule is the total length of all the time intervals during which the job is scheduled on a 
processor. A schedule is an assignment of all jobs in the system on the available processors. 

The task graph at the top, resource graph at the bottom and in between them scheduling algorithm and resource access 
control protocol used by an operating system is called scheduling algorithm. Therefore, it gives information about the 
precedence constraints. Functional and resource parameters of a job, and interdependence of processors as shown in the 
figure below. 



.. . .._ Model of real-time systems 

If a scheduler produces only valid schedules then it is called correctness of a schedule. A valid schedule is the one that 
satisfies the following conditions. 

• Every processor is assigned at most one job at any time. 

• Every job is assigned at most one processor at any time. 

• No job is scheduled before its release time. 

• The total amount of processor time assigned to every job is equal to its maximum or actual execution time. 

• All the precedence and resource usage constraints are satisfied. 

The assumptions here is that jobs do not ran in parallel on more than one processor to speed up their execution. 
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Feasibility, Optimality and Performance measures of Scheduling 

A valid schedule is also a feasible schedule if every job meets its timing constraints or completes by its deadline. A job is a 
schedulable according to a scheduling algorithm, if the scheduler always produces a feasible schedule. 

Miss Rate: The percentage of jobs that are executed but completed too late. 

Loss Rate: The percentage of jobs that are not executed at all or discarded. 

Loss rate cab be minimized provided the miss rate is below some threshold. A performance measure that captures this 
tradeoff is called invalid rate. It is sum of the miss rate and loss rate and gives percentage of all the jobs that do not 
produce a useful record. It should be as minimal as possible in order to make the soft real time system optimal. 

A hard real time system is optimal if the algorithm always produces a feasible schedule when the given set of jobs has 
feasible schedule. If an optimal algorithm fails to find a feasible schedule then the given sets of jobs can’t feasibly be 
scheduled by any algorithm. 

The other performance measures are maximum and average tardiness, response time, miss/loss and invalid rates. 

Lateness of job is the difference between its completion time and its deadline. It is negative if the job completes early, 
positive if completes late. The tardiness can never be negative though it is also a difference between its completion time 
and its deadline. To keep jitters in completion time small, use scheduling algorithms which minimizes the average absolute 
lateness of jobs. 

Scheduling jobs with same release time and deadline means scheduling to minimize the completion time of the job which 
completes last among all the jobs. The response time of this job is the response time of the set of jobs as a whole and is 
often called makespan of the schedule. Makespan is used to compare scheduling algorithms and the algorithm with shorter 
makespan is better. If the makespan is less than or equal to the length of their feasible interval, the job can meet their 
deadline. 


Functional Parameters 


| Things To Remember| 


The workload model must explicitly describe these relevant properties which is done by the values of functional 
parameters. 

Preemptivity of jobs - whether a job can be momentarily stopped and a new job with higher priority can be executed or 
not. 

Criticality of Jobs - all jobs may not be equally important, 
optional job can be discarded even when executing. 

Laxity type of a job means whether its timing constraints are hard or soft. 

Resource parameter gives the resource requirements. 

A graph that is used to describe the configuration of the resources is called resource graph. It gives the amount of 
resources available to execute the application, attribute of resources and rules governing their usage. 

If a scheduler produces only valid schedules then it is called correctness of a schedule. 
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4.1 Clock Driven Approach 


Clock-Driven Approach 

As the name suggests, when scheduling is clock-driven (also called time-driven), decisions on what jobs execute at what 
times are made at specific time instants. These instants are chosen a priori before the execution of the system begins. 
Typically, in a system that uses clock-driven scheduling, all the parameters of hard real-time jobs are fixed and known. A 
schedule of the jobs is computed off-line and is stored for use at run time. At each scheduling decision time, the scheduler 
schedules the jobs according to this schedule. This way, scheduling overhead during run-time can be minimized. 

A frequently adopted choice is to make scheduling decisions at regularly spaced time instants. One way to implement a 
scheduler that makes scheduling decisions periodically is to use a hardware timer. The timer is set to expire periodically 
without the intervention of the scheduler. When the system is initialized, the scheduler selects and schedules the job(s) that 
will execute until the next scheduling decision time and then blocks itself waiting for the expiration of the timer. When the 
timer expires, the scheduler awakes and repeats these actions. (Liu 60) 

Weighted Round Robin Scheduling 

The round-robin approach is normally used for scheduling time-shared applications. When jobs are scheduled on a round- 
robin basis, every job joins a First-in-First-out, FIFO queue when it becomes ready for execution. The job at the head of 
the queue executes for at most one-time slice. (A time slice is the basic granule of time that is allocated to jobs. In a time 
shared environment, a time slice is typically in the order of tens of milliseconds.) If the job isn’t complete by the end of the 
time slice, it is preempted and is placed at the end of the queue where it waits for its next turn. When there are n number of 
ready jobs in the queue, each job gets one-time slice every n time slices, i.e., every round. Because the length of the time 
slice is short, the execution of every job begins immediately after it becomes ready. Each job gets 1/n th share of the 
processor when there are n jobs ready for execution. Due to this reason, the round-robin algorithm is also known as the 
processor-sharing algorithm. 

The weighted round-robin algorithm has been used for scheduling real-time trafic in high-speed switched networks. It is 
built on the basic round-robin scheme. Rather than giving all the ready jobs equal shares of the processor, different jobs 
may be given different weights. Here, the weight of a job means the fraction of processor time allocated to the job. 
Specifically, a job with weight wt gets wt time slices every round, and the length of a round is equal to the sum of the 
weights of all the jobs ready. We can speed up or retard the progress of each job toward its completion by adjusting the 
weights of jobs. (Liu ,61) 

For example, consider two sets of jobs, J1 ={Ju, Jujand J2 ={ J 2 ,i, J 2 ^}, shown in below. The release times of all jobs are 0, 
and their execution times are 1. L i and J 2 ,i execute on processor PI, andlu and L.zexecute on processor P2. Suppose that 
Ji.i is the predecessor of Ju, and J 2 ,i is the predecessor of J 2 , 2 . 
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scheduling of precedence-constrained jobs 

Priority-driven scheduling 


(b) 


Example illustrating round-robin 


The priority-driven algorithms refers to a large class of scheduling algorithms that never leave any resource idle 
intentionally. It can be stated in another way as, a resource idles only when no job requiring their source is ready for 
execution. Scheduling decisions are made when events such as releases and completions of jobs occur. Therefore, priority- 
driven algorithms are event-driven. 

Other names for this approach are greedy scheduling, list scheduling and work-conserving scheduling. A priority-driven 
algorithm is said to be greedy because it tries to make locally optimal decisions. Leaving a resource idle while some job is 
ready to use the resource is not locally optimal. So, when a processor or resource is available and some job can use it to 
make progress, such an algorithm never makes the job wait. 

The term list scheduling is also descriptive because any priority-driven algorithm can be executed by assigning priorities 
to jobs. Jobs that are ready for execution are placed in one or more queues in the order of the priorities of the jobs. At any 
scheduling decision time, the jobs with the highest priorities are scheduled and executed on the available processors. 
Hence, a priority-driven scheduling algorithm is dcHAAAAAAA* ned to a great extent by the list of priorities it assigns 
to jobs; the priority list and other rules, such as whether preemption is allowed, dci'-'AAAAAAA* ne the scheduling 
algorithm completely. Most scheduling algorithms used in non-real-time systems are priority-driven. The examples 
include the FIFO (First-In-First-Out) and LIFO (Last-In-First-Out) algorithms, that assign priorities to jobs according their 
release times, and the SETF (Shortest-Execution-Time-First) and LETF (Longest-Execution-Time-First) algorithms, that 
assign priorities on the basis of job execution times. Because the priorities of jobs can be dynamically changed, even round 
robin scheduling can be thought of as priority-driven. The priority of the executing a job is lowered to the minimum value 
among all jobs waiting for execution after the job has executed for a time slice. (Liu , 62) 
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Nonpreemptive. 

Dynamic System vs. Static System 

If the jobs are scheduled on multiple processors and a job can be dispatched from the priority run queue to any of the 
processor, then the system is called dynamic. A job can migrate in a dynamic system. The migration occurs if the job starts 
execution on one processor and resumes on a different processors. 

If the jobs are partitioned into subsystems and each subsystem is bound statically to a single processor then the system is 
called static system. There is no possibility of job migration in static system. The static system has inferior performance in 
terms of overall response time relative to dynamic system. But, it is possible to validate static system, whereas there is no 
possibility of such validation in dynamic system. For this reason, most of the hard real time systems are static and soft real 
time system can be dynamic. 


(b) 

Example of priority-driven scheduling, (a) Preemptive (b) 


Clock Driven Approach 


1. Clock Driven Approach 

schedule calculated offline, can use complex algorithms 
applicable to deterministic systems 


since the parameters of all jobs are with hard deadlines 
2. Weighted Round Robin Scheduling 


: known, can construct a singel cyclic schedule in advance 


parameter is known before, so online scheduling can be done 

minimum runtime ahead 

response time of preemptive > non-preemptive 

also called greedy algorithm 

3. Priority Driven Scheduling 


list the priority of diffrent jobs 
FIFO, LIFO, SETF, LETF 
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4.2 Optimality of EDF 


Optimality of EDF 


Theorem 

When preemption is allowed and jobs do not contend for resources, the EDF algorithm can produce a feasible schedule of 
a set J of independent jobs with arbitrary release times and deadlines on a processor if and only if J has feasible schedules. 

Proof: 

We show that any feasible schedule of J can be systematically transformed into an EDF schedule. 

Suppose parts of two jobs Ji and Jk are executed out of EDF order. 



t f 

d k d, 


This situation can be corrected by performing a “swap”: 


If we inductively repeat this procedure, we can eliminate all out-of-order violations. 

The resulting schedule may still fail to be an EDF schedule because it has idle intervals where some job is ready: 


Such idle intervals can be eliminated by moving some jobs forward: 



Corollary 


When preemption is allowed and jobs do not contend for resources, the LRT algorithm can produce a feasible schedule of 
a set J of jobs with arbitrary release times and deadlines on a processor if and only if feasible schedules of J exist. 

Optimality of LST 

When preemption is allowed and jobs do not contend for resources, the LST (MLF) algorithm can produce a feasible 
schedule of a set J of jobs with arbitrary release times and deadlines on a processor if and only if feasible schedules of J 

Jl, 1(0,1) 

J2, 1(0,2) 

J3, 5(0,5) 

ST = deadline - current time - (remaining time to execute the job) 

At, t=0, slacktime (ST), 
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for J1 = 1 -0-(l-0) = 0 

for J2 = 2-0-(l-0)= 1 

for J3 = 5 - 0 - (5 — 0) = 0 

At t = 1, ST for J1 = 1 - 1 - (1 -1) = 0 

J2 = 2- I (1 - 0 ) = 0 

J3 = 5 — 1 - (5- 0) = -1 

Example 

Show that given jobs are LST optimal. 
Jl, 3(0,6), J2, 2(5,8), J3,2(2,7) 
Solution: 

At time t = 0, slack time, ST for 
Jl = 6 - 0 - (3 - 0) = 3 



At time t = 1, ST for 


Jl =6-2—(3 —2) = 3 


J3 = 7 - 1 - (2 - 0) = 4 



At time t = 2, ST for 


Jl = 6 - 2 - (3 - 2) = 3 


J2 = 8 - 2 - (32- 0) = 4 



At time 3 = 0, ST for 
Jl = 6 - 3 - (3 - 3) = 3 
J2 = 8 - 3 - (2 - 0) = 4 


J3 = 7 - 2 - (2 - 0) = 3 
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At time t = 4, ST for 
Jl=6-4-(3-3) = 2 
J2 = 8 - 4 - (2 — 0) = 2 
J3 = 7 - 4 - (2 - 1) = 2 



At time t = 5, ST for 
J1 = 6 - 5 - (3 — 3) = 1 
J2 = 8 — 5 - (2 — 1) = 2 
J3 = 7— 5 — (2 — 1) = 1 



At time t = 6, ST for 
J1 =6-6-(3-0) = 0 
J2 = 8 — 6 — (2 — 1) = 1 



Non- Optimality of EDF and LST 

The system shown in this 'HAAAAAAA* gure has three independent, nonpreemptable jobs Jl, J2, andJ3. Their release 
times are 0, 2 and 4, respectively, and are indicated by the arrows above the schedules. Their execution times are 3, 6, and 
4; and their deadlines are 10, 14, and 12, respectively. 

n r 2 r 3 

I I I 


7 3 misses its deadline 

an EDF schedule 


Here, J3 doesn’t meet deadline. So, it is non-optimal. 
Now, using EDF 


A non-EDF schedule 


It is also non-optimal because it contains slack time. 

The example in Figure below shows that the EDF algorithm is not optimal for scheduling preemptable jobs on more than 
one processor (here, we have two processors, J1>J2>J3). The system in this I-iAAAAAAA* gure also contains three jobs, 
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J1, J2, andJ3. Their execution times are 1, 1, and 5 and their deadlines are 1, 2, and 5, respectively. The release times of all 
three jobs are 0. The system has two processors. 



0 


P 


2 




J 3 misses its deadline 


(a) 


0 1 2 

(b) 


(a)The EDF schedule, (b) A feasible 

schedule 
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Optimality of EDF 


A way to assign priorities to jobs is on the basis of their deadlines. In particular, the earlier the deadline, the higher the 
priority. The priority-driven scheduling algorithm based on this priority assignment is called the Earliest-Deadline-First 
(EDF) algorithm. 


• The algorithm that is optimal for scheduling preemptive jobs on one processor is called the Least-Slack-Time-First (LST) 
algorithm. 

• TheLST algorithm assigns priorities to jobs based on their slacks: the smaller the slack, the higher the priority. 

• The EDF algorithm is not optimal for scheduling preemptable jobs on more than one processor. 
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4.3 Challenges in Validating Timing Constraints in Priority Driven Systems 


Challenges in validating Timing Constraints in Priority Driven System 

The priority driven scheduling has many advantages overclock driven scheduling. It is better suited for applications with 
varying time and resource requirement since it needs less information a priori. This schedule is easy to implement and has 
small runtime overheads when we use simple priority assignment but it is not widely used in hard real-time systems and 
safety critical systems where timing requirements need to be deterministic. It is also difficult to validate whether the 
algorithm needs the deadline or not. This is a problem called validation problem. 

The validation problem states that given a set of jobs, the set of resources available to the jobs and scheduling ot resource 
access control algorithm to allocate processors and resources to the jobs, determined whether all the jobs meet their 
deadlines. 

For example, consider two identical processors PI and P2 and jobs J1, J2, J3 and J4 as shown below. Also, consider that 
the jobs can be preempted but cannot be migrated. 


Jobs (J) 

Ri 

di 

fe, e,+] 

J1 

0 

10 

5 

J2 

0 

10 

[2, 6] 

J3 

4 

15 

3 

J4 

0 

20 

10 


The timing diagram for the given jobs with the execution time J2 is 6. 


0 5 10 15 20 



5 15 


(a) 


(b) 


(C) 


(d) 


Example illustrating scheduling anomalies. 


The best case schedule for J4 when the execution is equal to 5. J4 can execute as early as 15 and its completion time jitter 
exceeds the upper limit of 4. This phenomenon is called scheduling anomalies. 
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The scheduling anomalies indicate that the schedule looks like right but it is not in real. It is an unexpected timing 
behavior of the priority driven system. The scheduling anomalies can also occur for non-preemptive jobs when schedule in 
single processors. 

Offline versus online scheduling 

A clock-driven scheduler simply makes use of a pre-computed schedule of all hard real-time jobs. This schedule is 
computed off-line before the system begins to execute. The computation is based on the knowledge of the release times 
and processor-time or resource requirements of all the jobs for all times. When the operation mode of the system changes, 
the new schedule specifying when each job in the new mode executes is also pre-computed and stored for use. In this case, 
it is said that the scheduling is done off-line, and the pre-computed schedules are off-line schedules. 

The main disadvantage of off-line scheduling is inflexibility. Offline scheduling is possible only when the system is 
deterministic which means that the system provides some V-iAAAAAAA* * xed set(s) of functions and that the release times 
and processor-time or resource demands of all its jobs are known and do not vary or may vary only slightiy. However, for 
a deterministic system, off-line scheduling has many advantages. One of the advantages being the deterministic timing 
behavior of the resultant system. The complexity of the scheduling algorithm(s) used for this purpose is not important 
because the computation of the schedules is done off-line. (Liu, online versus offline scheduling 77-78) 

Competitiveness of On-Line Scheduling. 

It is said that scheduling is done on-line, if the scheduler makes each scheduling decision without having knowledge about 
the jobs which will be released in the future; the parameters of each job become known to the on-line scheduler only after 
the job is released. The priority-driven algorithms are on-line algorithms. The admission of each new task depending on 
the outcome of an acceptance test which is based on the parameters of the new task and tasks that were admitted earlier. 
This type of acceptance test is on-line. So, the future workload of an on-line scheduling is unpredictable. An on-line 
scheduler can accommodate dynamic variations in user demands and resource availability. The price of the flexibility 
and adaptability is a reduced ability for the scheduler to make the best use of system resources. The scheduler cannot make 
optimal scheduling decisions without prior knowledge about future jobs, while a clairvoyant scheduler can that knows 
about all future jobs.(Liu, online versus offline scheduling 78) 
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Challenges in Validating Timing Constraints in Priority Driven Systems 


A clock-driven scheduler makes use of a pre-computed schedule of all hard real-time jobs. The computation is based o 
the knowledge of the release times and processor-time or resource requirements of all the jobs for all times. 


• It is said that scheduling is done on-line, if the scheduler makes each scheduling decision without having knowledge about 
the jobs which will be released in the future. 
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5.1 Clock Driven Scheduling 


Notations and assumptions Assumptions 

Clock-driven system is applicable to deterministic systems. 

A restricted period task model. 

The parameters of all periodic tasks are known as a priori. 

For each mode of operation, system has a fixed number, n periodic tasks. 

For task Ti, each job J,, k is ready for execution at its release time r iJt and is released for Pi units of time after the previous job 
in Ti such that n, k = E. 

Variations in the inter release times of jobs in a periodic task are negligible. 

Aperiodic jobs may exists. 

Assume that the system maintains a single queue to for aperiodic jobs. 

Whenever the processor is available for aperiodic jobs, the job at the head of this queue is executed. 

There are no sporadic jobs. 

Recall: sporadic jobs have hard deadlines, aperiodic jobs do not. 

Notations 

The 4-tuple T = (cpi. Pi, ft, Di) refers to a periodic task n with phase 91, period Pi, execution time e 1; and relative deadline D.. 
Default phase of T is 9 = 0, default relative deadline is the period Di = Pi. Omit elements of the tuple that have default 
values. 

Examples 

T, = (1, 10,3, 6) -> 91 = 1, Pi = 10, ei = 3, Di = 6 

Ju released at 1, deadline 7 

Jureleased at 10, deadline is 17 

T 2 = (10, 3, 6) —> 92 = 0, P 2 = 10, e 2 = 3, D 2 = 6 

Ji.i released at 0, deadline 6 

Ji,2 released at 10, deadline 16 

T 3 = (10, 3) -► 93= 0, P 3 = 10, e 3 = 3, D 3 = 10 

J1.1 released at 0, deadline 10 

Ji, 2 released at 10, deadline 20 

Static, timer driven schedule 

Whenever the parameters of jobs with hard deadlines are known before the system begins to execute, a straightforward 
way to ensure that they will meet their deadlines is to construct a static schedule of the jobs off-line. This schedule 
specifies exactly when each job executes. According to this schedule, the amount of processor time allocated to every job 
is equal to its maximum execution time, and every job is completed by its deadline. During run time, the scheduler 
dispatches the jobs according to this schedule. Therefore, as long as no job ever overruns that is, some rare or condition 
with errors causes it to execute longer than its maximum execution time), all deadlines are surely met. Because the 
schedule is computed off-line, we can afford to use complex, sophisticated algorithms. Among all the feasible schedules 
(that is, those schedules where all jobs meet their deadlines), we may want to choose one which is good according to some 
criteria. (For example, the processor idles nearly periodically to accommodate aperiodic jobs). 

For example, we consider a system that contains four independent periodic tasks: 


T, = (4, 1), T 2 = (5, 1.8), T 3 = (20, 1), and T 4 = (20, 2). 

And their utilizations are 0.25, 0.36, 0.05, and 0.1, respectively. Hence, the total utilization is 0.76. 
Also, hyper-period, H - 20 (least common multiple of 4, 4, 20, 20) 

Ti starts execution at time 0, 4, 9.8, 13.8,.T 2 starts execution at 2, 8,12, 18,. 

Here all tasks meet their deadlines. 



T 2 

Ti | 

T ,4 

Tl 1 

lL 

T 2 1 

FI 

J 2 | 

liL 


0 4 8 12 16 20 

schedule. 


An arbitrary static 
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The scheduler can be implemented by storing the precomputed schedule as a table. In this table, each entry (t k , T(t k )) gives 
a decision time t k , that is an instant when a scheduling decision is made. T(t k ) is either the name of the task whose job 
should start at tk or I. 

Scheduler sets the hardware time to interrupt at the first decision time, t k = 0. 

On receipt of an interrupt at t k , the scheduler sets the timer interrupt to expire at t k+ i if previous task overrunning handle 
failure. 

If T(t k ) = I and aperiodic job waiting, start aperiodic job otherwise start next job in task T(tk) executing. 


k 

t k 

T(t) 

0 

0.0 

TI 

1 

1.0 

T3 

2 

2.0 

T2 

3 

3.8 

I 

4 

4.0 

TI 

5 

5.0 

I 

6 

6.0 

T4 

7 

8.0 

T2 

8 

9.8 

TI 

9 

10.3 

I 

10 

12.0 

T2 

11 

13.8 

TI 

12 

14.8 

I 

13 

17.0 

TI 

14 

17.0 

I 

15 

18.0 

T2 

16 

19.8 

I 


The stored table contains 17 entries. They are (0, Ti), (1, T 3 ), (2, T 2 ), (3.8,1), (4, Ti),...(19.8,1). Hence, the timer is set to 
expire at 0, 1, 2, 3.8, and so on. At these times, the scheduler schedules the execution of tasks Ti, T 3 , T 2 , and an aperiodic 
or background job, respectively. The table is used again during the next hyperperiod, and new decision times 20, 21, 22, 
23.8, and so on, can be obtained from the times in the fiAAAAAAAA* rst hyperperiod as described in the pseudo code 
below: 

Input: Stored schedule (t k , T(tk)) fork=0,l,...N -1. 

Task SCHEDULER: 

set the next decision point i and table entry k to 0; 

set the timer to expire at t k . 
do forever: 

accept timer interrupt; 

if an aperiodic job is executing, preempt the job; 

current task T = T(t k ); 
increment i by 1; 
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compute the next table entry k =i mod (N); 

set the timer to expire at [i/N]H + t k ; 
if the current task T is I, 

let the job at the head of the aperiodic job queue execute; 

else, let the task T execute; 

sleep; 

end SCHEDULER 

A periodic static schedule is called a cyclic schedule. This approach to scheduling hard real-time jobs is called the clock- 
driven or time-driven approach because each scheduling decision is made at a specific time. The decision is independent 
of events, such as job releases and completions in the system. And hence, a clock-driven system never exhibits the 
anomalous timing behavior of priority-driven systems. (Liu 86-87) 

General Structure of Cyclic Schedules Frames and major cycles 

Frame and major cycles 


ma J° r ..., i .i + l 

^ i ' 2 , 3 , 4 , - | 

frames TTI PI TTI 


t t+f t + 2f t + 3f t + Af t + H 

cyclic schedule 


General structure of; 


The above figure represents a good structure of a cyclic schedule. The scheduling decisions are made periodically 
according to the restrictions imposed by this schedule. The scheduling decision times partition the time line into intervals 
are called frames. Each frame has a length f (is frame size). There is no preemption within each frame because the 
scheduling decisions are made only at the beginning of each frame. The phase of each periodic task is a non-negative 
integer multiple of the frame size. In other words, the i-iAaAAAAAA* rst job of each task is released at the beginning of 
some frame. 


Frame size Constraints 


To avoid the preemption of the job, we make frame size f larger than the execution time ei of each task Ti. i.e. 


/ > max (e f ) 

l<i<« 

equation .... 1 

To minimize the length of the cyclic schedule, f should be chosen in such a way that it divides hyper-period, H. that is f 
dividesk the period pi of at least one task Ti. 


IPi/fl ~ Pi/f = 0 


equation.... 2 


To make it possible for the scheduler to determine whether every job completes by its deadline, the frame size should be 
small enough so that there is at least one frame between the release time and deadline of every job. 

2f-gcd(pi, f) < Di, for all I = 1, 2,..., n. 

All the frame size should be satisfied. 
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Job Slices 


Sometimes, a system cannot meet all the frame size simultaneously. We can often solve this by portioning a job with large 
execution time into slices called sub-jobs, with short execution time or deadlines. This gives the effect of preempting the 
large jobs, and hence, allow allow other jobs to run. 

Sometimes, it is necessary to partition jobs into more slices than required by the frame size to yield a feasible schedule 

Example: 

Consider a system with T, = (4, 1), T 2 = (5, 2, 7), T 3 = (20, 5). 

For Equation 1 to be true, we must have f >5, but to satisfy Equation 3 we must have f <4. 


iki IUiki l^iki ki 

0 4 8 12 16 20 


0 4 8 12 

16 20 

A cyclic schedule with frame size 2. 

|r,| t 2 |r„|r,| |r,| r, 

1 N T * 

1 k.l m 


0 4 8 

12 

16 20 

A preemptive cyclic schedule of Tl = (4,1), 


This can be solved by splitting T 3 into Tu = (20, 1), T 3 2 = (20, 3) and T 3 3 = (20, 1). 

The resultant system has HAAAAAAAA* ve tasks which can be scheduled with the frame size, f =4. The figure shows 
cyclic schedule for these tasks. The 3 original tasks are called Ti, T 2 and T 3 , respectively. And, the 3 subtasks of T 3 are 
called Tu, T 3 , 2 , andT 3 , 3 . 
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Clock Driven Scheduling 


[ Things To Remember | 


There are n periodic tasks in the system. 

The parameters of all periodic tasks are known a priori. 

Each job Ji,k is ready for execution at its release time ri,k 

A periodic static schedule is called a cyclic schedule. This approach to scheduling hard real-time jobs is called the clock- 
driven or time-driven approach. 

The scheduling decision times partition the time line into intervals are called frames. Each frame has a length f. 
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5.2 Cyclic Executives 


CYCLIC EXECUTIVES 

Cyclic executive approach is a way of modifying the cock driven scheduler to accommodate the restrictions that the 
scheduling decisions are made only at frame boundaries. 

Cyclic executive is used to mean a table-driven cyclic scheduler for all types of jobs in a multithreaded system. It makes 
scheduling decisions only at the beginning of each frame and deterministically interleaves the execution of periodic tasks. 
However, it also allows aperiodic and sporadic jobs to use the time that are not used by periodic tasks. 

The cyclic executive executes at each of the clock interrupts after taking over the processor. The clock interrupts occur at 
the beginning of frames. When it executes, the cyclic executive copies the table entry for the current frame into the current 
block. It then wakes up a job, called periodic task server, and lets the server execute the job slices in the current block. 
Upon the completion of the periodic task server, the cyclic executive wakes up the aperiodic jobs in the aperiodic job 
queue in turn and allows them to use the remaining time in the frame. Here, it is assumed that the cyclic executive wakes 
up and executes whenever a job/server completes. 

The cyclic executives also check for overruns at the beginning of the each frame and takes and appropriate way of 
handling these frame overruns. The two assumptions for cyclic executive are: 

the existence of timer and, 

each timer interrupt is handled by the cyclic executive within a bounded amount of delay 

IMPROVING THE RESPONSE TIME OF APERIODIC JOB. 

The sooner an aperiodic job completes, the more responsive the system is. For this reason, minimizing the response time 
of each aperiodic job or the average response time of all the aperiodic jobs is typically one of the design goals of real-time 
schedulers. 

Slack stealing 

Slack stealing was originally proposed for the priority driven systems. Slack stealing approach a way of improving the 
response time of aperiodic jobs by executing the aperiodic jobs ahead of the periodic jobs whenever possible. Each 
periodic job slice must be scheduled in a frame that ends no later than its deadline for the slack stealing approach to work. 

Let the total amount of time allocated to all the slices scheduled in the frame k be x k . The slack time available in the frame 
is equal to f — x k at the beginning of the frame. If the aperiodic job queue is nonempty at this time, the cyclic executive can 
let aperiodic jobs execute for this amount of time without causing any job to miss its deadline. When an aperiodic job 
executes ahead of slices of periodic tasks, it consumes the slack in the frame. After y units of slack time are used by 
aperiodic jobs, the available slack is reduced to f -x k -y. The cyclic executive can let aperiodic jobs execute in frame k as 
long as there is slack, i.e., the available slack f —Xk — y in the frame is larger than 0. 

When the cyclic executive HAAAAAAAA* nds the aperiodic job queue empty, it lets the periodic task server execute the 
next slice in the current block. The amount of slack remains the same during this execution. The cyclic executive returns to 
examine the aperiodic job queue after each slice completes as long as there is slack. (Liu 92-93) 
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slack stealing 


Example illustrating 


Execution of sporadic jobs (Acceptance Test) 

The sporadic jobs have hard deadlines, release time and execution time that are not known a priori. Therefore, a clock 
driven scheduler cannot guarantee a priori the sporadic jobs complete in time. However, the scheduler can determine 
whether a sporadic job is schedulable when it arrives using acceptance test. The acceptance test is performed to check 
whether the newly release sporadic job can be feasibly schedulable in the presence of all the jobs in the system at that 


If the time is sufficient, slack time in the frames before the deadline of new job, the new sporadic job is accepted, 
otherwise, it is rejected. If more than one sporadic jobs arrive at once then these should be queued for acceptance in EDF 
order. 

Consider the following example. 

5, (17,4.5) S 2 (29,4) 5 3 (22,1.5) S 4 (44,5.0) 

1 I 

I i 

0 3.5 4 7 8 10 12 15 16 19 20 



20 24 28 32 36 40 





40 44 48 52 56 60 


Example of scheduling 

sporadic jobs. 

EDF algorithm can be used to schedule the accepted sporadic jobs. For this, the scheduler maintains a queue of accepted 
sporadic job in non-decreasing order of their deadlines and inserts each newly accepted sporadic jobs into this queue in the 
same order. Whenever all the slices of sporadic task schedule in each frame are completed, the cyclic executive lets the job 
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in sporadic job queue execute in the order. The aperiodic jobs are allowed to accept only when accepted sporadic job 
queue is empty. 

Suppose, 

1. At time T = 3, sporadic job S1 with deadline 17 and execution time 4.5 is released. The acceptance test for this job is 
performed at time T = 4. Therefore, it must be scheduled in the frame 2, 3, and 4 which is less than the execution time of 
SI. Therefore, the scheduler rejects this jobs. 

2. At T = 5, job S2 with D = 29 and E = 4 is released. It can be executed in the frames 3 to 7. The acceptance test is 
performed at T = 8. The total amount of slack is 5.5 which is greater than the execution time E = 4. Therefore, the 
scheduler accepts this jobs and the first part of S2 with E = 2 executes in frame 3. 

3. At T = 14, job S3 with D = 22 and E = 2.5 releases. The acceptance test at T = 12 finds that 2 unit of slack is available in 
the frame 4 and 5 where S3 can be schedule because the deadline of S2 is later than S3. 

4. At T = 14, S4 with D = 44 and E = 5 is released. At T = 16, the acceptance test is performed. The slack time available is 
4.5 before D = 44. This slack time is less than the sporadic job S4 is rejected while the jobs S2 and S3 complete within 
their deadlines. 
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Cyclic Executives 



1. 1. To construct a cyclic schedule, we need to make three kinds of design decisions: 


<!— [if IsupportLists]—>a. <!—[endif]—>Choose a frame size based on constraints 

<!— [if !supportLists]->b. <!—[endif]->Partition jobs into slices 
<!— [if IsupportLists]—>c. <!—[endif]—>Place slices in frames 

1. 2. These decisions cannot be taken independently: 

<!— [if IsupportLists]->a. <!—[endif]—>Ideally want as few slices as possible, but may be forced to use more 

to get a feasible schedule 

1. Cyclic executive is used to modify table driven scheduler to be frame based. 
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5.3 Practical Considerations 


Practical Considerations 

1. Handling overruns: 

• Jobs are scheduled based on maximum execution time, but failures might cause overrun 

• A robust system will handle this by either: 1) killing the job and starting an error recovery task; or 2) preempting the job 
and scheduling the remainder as an aperiodic job 

• Depends on usefulness of late results, dependencies between jobs, etc. 

2. Mode changes: 

• A cyclic scheduler needs to know all parameters of real-time jobs a priori 

• Switching between modes of operation implies reconfiguring the scheduler and bringing in the code/data for the new jobs 

• This can take a long time: schedule the reconfiguration job as an aperiodic or sporadic task to ensure other deadlines met 
during mode change 

3. Multiple processors: 

• Can be handled, but off-line scheduling table generation more complex 

Algorithm for construction static schedule 

A system of independent preemptable periodic task whose relative deadline is equal to or greater than their respective 
periods is schedulable if and only if the total utilization of the task is not greater than 1. 

Some tasks may have relative deadlines shorter than their periods and the cyclic schedule is constrained to have structure 
and a feasible schedule may not exist event if the conditions are met. Therefore, an algorithm is used to find out a feasible 
cycle schedule if one exists. This algorithm is called iterative network flow algorithm of INF algorithm. 

It assumes that the task can be preempted at any time and are independent. In order to apply INF algorithm, find all the 
possible frame sizes of the system. For example, the possible frame sizes of the tasks Ti(4, 1), T 2 (5, 2, 7), T 3 (20, 5) are 2 
and 4. It satisfies the second and third conditions but not the first condition. In such cases, INF iterative finds a feasible 
cyclic schedule for a possible frame size at a time starting from largest frame size to the smallest frame size. 

The feasible schedule found in this way tells how to decompose some tasks into sub-tasks if the algorithm fails to find a 
feasible schedule. In this way, the given tasks do not have feasible cyclic schedule that satisfies all the framesize 
constraints. 

Network Flow Graph 

The network flow graph of preemptive scheduling jobs is used by INF. To use this algorithm, ignore the tasks to which 
the jobs belong and names the job to be scheduled in a major cycle of F frames as Jl, J2, J3,., Jn. 

The constraints on which the jobs can be scheduled are represented by the network flow graph of the system. This graph 
contains the following vertices and edges and the capacity at an edge is a non-negative number associated with the edge. 

1. A job vertex Ji represents each job. 

2. A frame vertex j represents each frame j in the major cycle. 

3. Contains two special vertices called source and sink. 

4. A directed edge (Ji, j) from a job vertex Ji to a frame vertex j. if the job Ji can be scheduled in the frame j and capacity of 
the edge is the frame size f. 

5. A directed edge from the source vertex to every jobs and the capacity of this edge is the execution time e s of the job. 

6. A directed edge from every vertex to the sink and the capacity of the edge is T. 

A flow of an edge is a non-negative number satisfying the following condition. 

1. No greater than the capacity of the edge. 

2. The sum of flows of all the edges into every vertex is equal to the sum of the flows of all the edges out of the vertex except 
source and sink. 

The above conditions can be shown in the following network flow graph. 
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Jobs 


Frames 



• Ji can be scheduled in frames x and y. 

• Jk can be scheduled in frames y and z. 

• (e k ),e k = “capacity of flow”. 

Pros and cons of clock driven scheduling 

There are many advantages of Clock driven scheduling. Some of them are listed below. 

1. Conceptual Simplicity 

• It defines the ability to consider complex dependencies, communication delays, and resource contention among jobs when 
constructing the static schedule, guaranteeing absence of deadlocks and unpredictable delays 

• The entire schedule is captured in a static table 

• Different operating modes can be represented by different tables. 

• No concurrency control or synchronization is required. 

• If completion time jitter requirements exist, can be captured in the schedule 

2. When workload is mostly periodic and the schedule is cyclic, timing constraints can be checked and enforced at 

each frame boundary 

3. The choice of frame size can minimize context switching and communication overheads 

4. Clock driven scheduling is relatively easy to validate, test and certify. 

Despite having its advantages, it has some disadvantages. These are listed below. 


1. Inflexible: 

• Pre-compilation of knowledge into scheduling tables means that if anything changes materially, have to redo the table 
generation 

• Best suited for systems which are rarely modified once built 


2. Other disadvantages: 

• Release times of all jobs must be fixed 

• All possible combinations of periodic tasks that can execute at the same time must be known a priori, so that the combined 
schedule can be pre-computed 

• The treatment of aperiodic jobs is very primitive 

• Unlikely to yield acceptable response times if a significant amount of soft realtime computation exists 
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Practical Considerations 


1. Practical considerations include handling overruns, mode changes and multiple processors. 

2. The main advantage of clock driven schedule is conceptual simplicity whereas inflexible is the disadvantage. 
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6.1 Priority Driven Scheduling of Periodic Tasks 


• R7!TH 

• Things To Remember _ 

Static Assumptions 

Assume a restricted periodic task model in a single processor with a fixed number independent periodic task. Also, assume 
that the jobs compressing through task are ready for execution as soon as they are released, can be preempted at any time 
and never suspend themselves. There are no aperiodic sporadic task and the new task are admitted only after the 
acceptance test. The scheduling decision are made immediately upon job release and completion. Algorithms are event 
driven and never intentionally leave a resource idle. The context switch overhead is negligible and the jobs may be 
rejected after acceptance test. 

Fixed versus Dynamic Priority Driven Algorithms 


A priority driven scheduling is an online scheduler in which the priority of each periodic task is fixed relative to other task. 
This scheduler doesn’t pre-compute a schedule of the tasks, but assigns priority to the jobs when they are released and 
places them on a run queue in priority order. When preemption is allowed, a scheduling decision is made whenever a job 
is released of completed. Each scheduling decision time, the scheduler updates the run queues and execute the jobs at the 
head of the queue. 

The priority driven scheduler are classified into two types based on how the priorities are assigned to the jobs. They are 
fixed priority and dynamic priority. 

All the jobs in a task may be assigned to the same priority called task level fixed priority or different priorities to each job 
in each task called task level fixed priority. 

Therefore, in the second case, the priority of task with respect to other task changes as they are released and completed. 

The priority of each job is usually fixed called job level fixed priority. In this, the priority of each job is assigned on its 
release when it is inserted into ready job queue. Therefore, once the priority is assigned, it doesn’t change. It means, at the 
level of individual jobs, the priorities are fixed even though the priorities are that task level are variable. Similarly, some 
system may vary the priority of a job after it has started execution and such schedules are called job level dynamic priority. 
The job level dynamic priority schedules are usually inefficient. The best example of fixed priority algorithms are rate 
monotonic algorithm and deadline monotonic algorithm. 

Rate Monotonic scheduling 

Rate monotonic algorithm is the best known example of fixed priority algorithm. This scheduling assigns priority to the 
task based on their periods. The shorter the period the higher the priority. The rate of jobs releases is the inverse of the 
period i.e. Rate = 1/period. Therefore, the jobs with higher rate have higher priority. For example, 


Consider the task T! = (4, 1) T 2 = (5, 2) T, = (20, 5). The rate of these task are R, = 1/4, R 2 = 1/5 and R, = 1/20 (i.e. R, = 
0.25, R 2 = 0.2, R 3 = 0.25). 

The rates are Ri>R 2 >R 3 and the priority is Ti>T 2 >T 3 . The timing diagram based on the rate monotonic scheduling is. 


|t,| t 2 |r 3 |r,| t 2 |7’ 3 |7’i|r 3 | t 2 |r,| t 3 \t 2 \t x \t 2 \ 

0 4 8 12 16 


k 


(4,1), T 2 = (5,2), and T 3 = (20,5) 


RM schedule of Ti = 


All the three task are in phase. Therefore, the first jobs Ju, J 2 ,i J 3 ,i of all the tasks are ready to execute at time t = 0. Since, 
according to RM algorithm the first task has highest priority and the third task has the lowest priority. The first job j u of 
first task Ti executes at t = 0. At t = 1, Ju completes and J 2 .i starts execution because it has higher priority than J 3 ,i. At t = 3, 
Ji,istarts execution but at t = 4, the second Ju of task Ti is ready to execute. Therefore, Jupreempts the job J 3 ,i. This process 
continues until t = 20 which is hyperperiod of the given task. This sequence of execution is repeated after t = 20. 

The rate monotonic algorithm can be shown in the table as shown below. 
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Time 

0 

1 

3 

4 

5 

7 

8 

9 

10 
12 
13 

15 

16 

17 

18 

19 

20 


Ready to run 

Running 

J„, Jz„ J3.1 

J,., 

Jv , Jb, 

Jb., 

Jb, 

Jb., 

J 3 „ J 1,2 

J 1.2 

J2.2, Jb., 

J2.2 

Jb., 

Jb., 

J i.3, Jb., 

J 1,3 

J2.3, Jb., 

Jb., 

J 2 . 3 , Jb,, 

J2.3 

Jm, Jb., 

J,4 

Jb, 

Jb, 

J2.4 

J2.4 

J,5, J 2,4 

Ju 

J2.4 

J2.4 


Repeat in the above 
pattern 


Deadline Monotonic Scheduling 

DM is a fixed priority algorithm which assigns priority to the tasks according to the relative deadlines. The shorter the 
relative deadline the higher priority. Therefore, the tasks with the shortest relative deadline is executed first and the longest 
relative deadline is executed last. 


Consider the following tasks. 

T, = (50, 50, 25, 100) 

T 2 = (0, 62.5, 10, 20) 

T 3 = (0, 125, 25, 50) 

The timeline for these tasks according to DM algorithm is shown below: 
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T \ 


T \ 

0 

missed 

deadline 

■ I 

0 125 f 250 

missed 
deadline 

When the relative deadline of every task matches period, then the rate monotonic algorithm and deadline monotonic 
algorithm gives identical result but when the relative deadline are arbitrary, then the deadline monotonic can sometimes 
produces a feasible schedule whereas RM may not. The RM algorithm fails when the DM algorithm is preferred rather 
than RM if the deadline is identical to the period. 

References 

Liu, Jane W. S. Real Time Systems. Integre Technical Publishing Co., Inc, January 10, 2000. Print. 
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Priority Driven Scheduling of Periodic Tasks 

• Note 


1. When tasks are independent, the scheduler can delete any task and add an acceptance task at any time without causing any 
missed deadline. 

2. The scheduler accepts and adds the new task to the system only if the new task and all other existing tasks can be feasibly 
scheduled, otherwise, the scheduler rejects the new task. 

3. A priority-driven scheduler is an on-line scheduler which assigns priorities to jobs after they are released and places the 
jobs in a ready job queue in priority order according to some priority-driven algorithm. 

4. Types of priority driven algorithms: fixed priority algorithm and dynamic priority algorithm. 

5. A fixed-priority algorithm assigns the same priority to all the jobs in each task. 

6. A dynamic-priority algorithm assigns different priorities to the individual jobs in each task. 

7. Rate-Monotonic Algorithm assigns priorities to tasks based on their period: the shorter the period, the higher the priority. 

8. Deadline-Monotonic Algorithm assigns priorities to tasks according their relative deadlines: the shorter the relative 
deadline, the higher the priority. 
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6.2 Maximum Schedulable Utilization 


Maximum Schedulable Utilization 

Any system is schedulable by an algorithm if the algorithm always produces a feasible schedule of the system. It is 
possible to schedule (and feasible) only if the feasible schedule of the system exists. Maximum total utilization means the 
maximum total utilization of the system in order for the system to be surely schedulable. 

Schedulable Utilization of EDF Algorithm 

We know that a periodic task Ti is denoted by 4-tuple (91, Pi, c,, Di) with utilization Ui = c,/P,. 

The total utilization of the system 

U = Y.?=i ui, where 0<=U<=1 

A scheduling algorithm can feasibly schedule any system of periodic tasks on a processor if U is equal to or less than the 
maximum schedulable utilization of the algorithm (Ualg) and if Ualg= 1, then the algorithm is optimal. 

Ualg id important because it gives a schedulability test where a system can be validated by showing that U <= Ualg. 

Theorem: 

A system of independent, preemptable periodic tasks with relative deadlines equal to their respective periods (i.e. Di=Pi) 
can be feasibly scheduled on one processor if and only if its total utilization is equal to or less than 1 (U <= 1). 

Proof: 

Consider that the relative deadline of every task is equal to its period, then according to the theorem any such system can 
be feasibly scheduled if its total utilization is equal to or less than 1, regardless of number of tasks. On the other hand, if 
any system fails to meet some deadlines, then obviously the total utilization is greater than 1. 

Based on above theorem and the discussion, we can say that the periodic tasks with Di = Pi. 

The corollary of above theorem states that, the result also holds true if the relative deadline is longer than the period. That 
means UraTcr independent preemptable periodic tasks with Di >= Pi. 

Note: above result is independent of Pi and the result can also be applied to strict LST. 

Schedulability Test for the EDF Algorithm 

A test that is used to validate that the given systems can indeed meet all its hard deadlines when scheduled according to a 
chosen scheduling algorithm is called a schedulability test. If this test is efficient, then it is can be used as an on-line 
acceptance test. 

Schedulability test is used to check whether a set of periodic tasks meet all their deadlines. 

Suppose, 

1. The periodic (Pi), execution time (&), and their relative deadline (Di) of every task Ti in a system T = {Ti, T 2 ,. . . ., T„} and 

2. A priority driven algorithm used to schedule the tasks in T preemptively on one processor are given. 

In the above given conditions if the phases are also given and if the values of all the given parameters do not vary, the 
problem can be solved by simulation. For this, we can construct a segment of schedule (time line) of these tasks according 
to given algorithm. The length of the segment will be equal to 2H + maxi Ph- maxi Di. If we do not find any missed 
deadlines in this segment, then Ti will certainly meet its deadlines. But, this method doesn’t work if the parameters vary. 
To determine whether the given system of n independent periodic tasks surely meet all the deadlines when scheduled 
according to preemptive EDF algorithm on one processor, check the following inequality condition called schedulability 
condition. 


n 


E 


e k 

min (Da, p k ) 


< 1 
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If this condition is satisfied, then the system is scheduled according to EDF algorithm. But if this condition is not satisfied, 
then the following two conclusions can be made which depends on the relative deadline (Di). 

1. If D k >=Pk for all k from 1 to n, eq. i becomes U<=1, which is both a necessary and sufficient condition for a system to be 
feasible. 

2. If D k <Pk for some k, eq. i is only a sufficient condition and so the system may not be schedulable when the condition is not 
satisfied. 

If the actual inter-release times are sometimes longer than P k , the total demand for processor time will be smaller. 
Therefore, if according to eq. i the system is schedulable on one EDF basis, it remains schedulable when the inter-release 
times of jobs in some tasks are sometimes longer than the respective periods of the tasks. 

Optimality of RM and DM 

RM and DM algorithms assigns fixed priorities to the tasks. So, they can’t be optimal. They may fail to schedule some 
systems, even though feasible schedule exists. For example, consider two tasks Ti = (2, 1) and T 2 = (5, 2.5). 

Total utilization (U) = 1/2 + 2.5/5 = 1 

Since, U = 1, the task are feasible. 

Here, Ju and Im can complete in time only if they have a higher priority than J 2 ,i. It means between t = 0 to 4, it must have 
a higher priority than T 2 . But at time 4 when Ji, 3 release, J 2 .ican complete in time only if T 2 (or J 2 ,i) has a higher priority than 
Ti (or J u ). This change in the relative priorities of the tasks is not allowed by any fixed priority algorithm. 

Any fixed priority algorithm can be optimal in restricted system. RM (DM) algorithm is not optimal for tasks with 
arbitrary periods. But it is optimal in a special case when the periodic tasks in the system or simply periodic and deadlines 
of the tasks are not less than their respective periods. 

A system of periodic tasks is said to be simply periodic if the period of each is an integer multiple of the periods of the 
other tasks. That is, 

Pk = nPi 

Where, P<Pk and n is a positive integer for all Ti and T k . 

The following theorem, states that the RM algorithm is optimal for scheduling a system having above condition. 

Theorem for optimality of RM Algorithm: 

A system of simply periodic independent, preemptable tasks whose relative deadlines are equal to larger than their periods 
(i.e. Di>=R) is schedulable on one processor according to RM algorithm if and only if its total utilization is equal to or less 
than 1. 

Proof: 

Assume that in a simple periodic system, all the tasks are in phase (worst case execution time occurs when tasks are in 
phase). Also, consider that the processor never remains idle before the task Ti misses a deadline at t for the first time, 
where t is an integer multiple of Pi. 

Since, the system is simply periodic, t is also an integer multiple of periods (P k ) of all higher priority tasks (T k ) for k = 1,2, 
.i-1. Therefore, the total time required to complete jobs with deadlines before and at t is 

to 

of the highest priority task. It means, T misses a deadline at t means that the demand for time exceeds t. in other words, 
RM fails when Ui>l. 

Theorem for Optimality of DM Algorithm. 

Generally, fixed priority scheduling is not optimal. But, it gives more predictable and stable system. Among the fixed 
priority algorithms, DM algorithm is the best and optimal. 

Theorem: 

A system T of independent preemptable tasks that are in phase and have D i <=P i can be feasibly scheduled on one 
processor according to DM algorithm whenever it can be feasibly scheduled according to any fixed priority algorithm. 

Proof: 

This theorem is true because we can always transform a feasible fixed priority schedule which is not DM into DM 
schedule. 

For example, consider a system having feasible fixed priority schedule which is not a DM schedule. 
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Now, arrange all the tasks starting from Ti with the shortest Di in order of increasing Di. If there are two tasks Ti and 
T i+ i with Di<D i+ i and priority of T w higher than Ti. In such case, we can switch the priorities of these two tasks and modify 
the schedulable of two tasks that are assigned on the DM basis relative to one another. By repeating this process, the given 
schedule can be transformed into DM schedule. 

When Di of every task Ti is equal to dPi for some constant 0<d where d is density, then the DM and RM algorithms are 
same. In this case, the corollary of above theorem can be stated as “the RM algorithm is optimal among all fixed priority 
algorithms whenever the relative deadlines of the tasks are proportional to their periods.” 

References 

Liu, Jane W. S. Real Time Systems. Integre Technical Publishing Co., Inc, January 10, 2000. Print. 


Maximum Schedulable Utilization 


• Note 


1. The Earliest-Deadline-First (EDF) algorithm assigns priorities to individual jobs in the tasks according to their absolute 
deadlines. 

2. The Least Slack Time (LST) scheduler checks the slacks of all the ready jobs each time a new job is released and orders 
the new job and the existing jobs on the basis of their slacks: the smaller the slack, the higher the priority. 

3. We call a test for the purpose of validating that the given application system can indeed meet all its hard deadlines when 
scheduled according to the chosen scheduling algorithm a schedulability test. 

4. We can use the schedulability condition of the EDF algorithm as a rule to guide the choices of the periods and execution 
times of the tasks while we design the system. 
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6.3 Schedulability of Fixed Priority Tasks with Short Response Time 


Schedulability of fixed-priority tasks with short response time 

The fixed priority algorithms are predictable and do not suffer from scheduling anomalies. The worst case execution time 
of the system occurs with the worst case execution time of the jobs, unlike dynamic priority algorithm which can exhibit 
anomalous behavior. Pseudo-polynomial time schedulability test can be used for the tasks scheduled according to the fix 
priority algorithms. In this, the response times of the jobs should be smaller than or equal to their respective periods and 
the total utilization should be less than or equal to 1. It means, every job completes execution before the next job if the 
same task is released. 

The above fact can be used for general schedulability test of fixed-priority tasks with short response time. The methods 
used for the test are, 

1. Find the critical instant when the system is most loaded and has its worst response time. 

2. Use time demand analysis to determine whether the system is schedulable at that instant. 

3. Prove that if a fixed priority system is schedulable at the critical instant then it is always schedulable. 

Time Demand Analysis 

The time demand analysis is used to determine the system behavior at the critical instant. For each job J , c released at a 
critical instant if J«and all higher priority tasks complete executing before their Di, then the system can be scheduled. To 
prove this we can compute the total demand for processor time by a job released at a critical instant and all the higher 
priority tasks as a function of time from the critical instant. A test is conducted to check whether this demand can be met 
before the deadline. 

Consider one task Ti at a time starting from highest priority and working down to lowest priority. Consider a job Tin 
T where the release time are 0 of that job is a critical instant of T. At time to + t for t>=0, the processor time demand wTor 
this job and all the higher priority jobs released between hand t 

wi = ei+Xtii(-^)ek 

Where, c, = execution time of job 

£fc=i (^) e k = execcution time of higher priority jobs 


Compare the time demand Wi(t) with the available time t then 

1. If Wi(t) <=t, then the job T meets its deadline. 

2. If Wi(t)>t, for all 0<=t<=Di then the task may not complete by its deadline and the system may not be scheduled using a 
fixed priority algorithm. 

The above two cases are sufficient conditions but not necessary condition. We can use this method to check that all the 
tasks can be scheduled if they are released at their critical instant and we can conclude that the entire system can be 
scheduled. 

Consider three tasks Ti = (3, 1), T 2 = (5, 2) and T 3 = (10, 2) scheduled according to rate monotonic algorithm. The total 
utilization U = 0.933. For these tasks, we can draw a graph for time demand function as shown below: 
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Time (t) 


The time demand function is a staircase function. The steps in the time demand for a task occurs at multiples of the period 
for higher priority task. The value of Wi(t) and t linearly decreases from a step until the next step. In the figure, the time 
demand function Wi(t), w 2 (t) and w 3 (t) are not above the timeline t at their deadlines. Therefore, all the tasks in the system 
meet their deadlines and can be feasibly scheduled. 

Critical Instant 

A critical instant of a task Ti is a time instant in which 

1. The job in T released at this instant has the maximum response time of every job in Tiis equal to or less than the relative 
deadline Diof T and 

2. The response time of the job released at the instant is greater that Di if the response time of some jobs in T exceeds Di. 
Therefore, a critical instant for a job is the worst case release time for that job taking into account all jobs that have higher 
priority. It means, the job released at the same instant as all jobs that have higher priority are released and must wait for all 
those to complete before it executes. The response time of a job in Tireleased at a critical instant is called the maximum 
(possible) response time and is denoted by Wi. 

The schedulability test involves checks each task, in turn, to verify that it can be scheduled when started at a critical 
instant. If it is schedulable at all critical instants, then it will work at other times as well. 

Now, we can say that a critical instant of a task T is a time instant such that: 

If Wi,k<=Di. k for every T k in T.then the job released at that instant has the maximum response time of all jobs in Tiand 
W,=W,, k . 

Else if, 


3Ji*:Wi*>Dik 

Then the job released at that instant has response time > D where W,, k is the response time of job. 

THEOREM 
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In a fiAAAAA* xed-priority system where every job completes before the next job in the same task is released, a critical 
instant of any task Ti occurs when one of its job J 1C is released at the same time with a job in every higher-priority task, 
that is, ri, c =r k ,li for some lk for every k =l,2,...,i -1. 

Schedulability Test for Fixed Priority Tasks with Arbitrary Response 
Time 

Busy Intervals 

Consider a term level-7ti busy interval. A level-Tii busy interval (to,t] begins at the instant to when 

1. All the jobs in Ti released before the instant have completed and 

2. A job in Ti is released. 

The interval ends at the first instant t after to when all the jobs in Ti released since tO are completed. It means, in the 
interval [tO, t], the processor is busy all the time executing jobs with priorities TtiOr higher. All the jobs executed in the 
busy interval are released in the interval and at the end of the interval, there is no backlog of jobs to be executed 
afterwards. Therefore, while computing the response time of jobs in T„ we can consider every level-7ti busy intervals. 
Consider the schedule of three tasks TI = (2, 1), T2 = (3, 1.25) and T3 = (5, 0.25) 

Using the RM algorithm, the timing diagram is as follows: 



o 2 4 6 8 to 12 



7j = (2, 1), T 2 = (3, 1.25), T 3 = (5, 0.25) 


Example illustrating busy interval 


General Schedulability Test 

The general schedulability test is described below. It relies on the fact that when determining the schedulability of a task 
T in a system in which the response times of jobs can be larger than their respective periods, it still suffices to 
confiAAAAA* ne our attention to the special case where the tasks are in phase. However, the first job Jy may no longer 
have the largest response time among all jobs in Ti. Consequently, we must examine all the jobs of Ti that are executed in 
the first level-7ii busy interval. If the response times of all these jobs are no greater than the relative deadline of Ti, Ti is 
schedulable; else, T s may not be schedulable. 

General Time-Demand Analysis Method 

Test one task at a time starting from the highest priority task Ti in the order of decreasing priority. To determine whether a 
task T is schedulable, let us assume that all the tasks are in phase and the first level-7ti busy interval begins at time 0. 

While testing whether all the jobs in T s can meet their deadlines (that is, whether Ti is schedulable), consider the subset 
Ti of tasks with priorities 7ti or higher. 

1. If the first job of every task in Ti completes by the end of the fiAAAAA* rst period of the task, check whether the first job 
T i in Ti meets its deadline or not. Ti is schedulable if J u completes in time. Else, Ti is not schedulable. 

2. If the fiAAAAA* rst job of some task in Ti does not complete by the end of the first period of the task, do the following: 

3. Compute the length of the in phase level-7ti busy interval by solving the equation 
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t=EU(^P 


iteratively, starting from 

t<MUrt 

until t H+1) = t">. The solution t (1> is the length of the level-7ti busy interval. 

1. Compute the maximum response times of all ft <0 /pi] jobs of Ti in the in=phase level- 7ti busy intervals in the manner 
described below and determine whether they complete in time. 

Ti is schedulable if all these jobs complete in time; else, Ti is not schedulable.(Liu 141-142) 

Sufficient Schedulability Condition for RM and DM algorithm Schedulable Utilization of RM Algorithm when Di = P 

Theorem 

A system of n independent, preemptable periodic tasks with relative deadlines equal to their respective periods can be 
feasibly scheduled on a processor according to the RM algorithm if its total utilization U is less than or equal to 

U™(n) =n(2 1,n —1) 

UmM o s ■ . 


n UrmO 1 ) as a function n. 

Example, 

T, = (1.0, 0.25), T 2 = (1.25, 0.1), T 3 = (1.5, 0.3), T 4 = (1.75, 0.07), and T s = (2.0, 0.1). 
Utilizations are: 0.25, 0.08, 0.2, 0.04, and 0.05 

Total utilization = U = 0.62 
Urm (5) = 0.743 


Practical Factors 

It includes the following: 

1. Non-preemptable 

2. Self-suspension 

3. Context switch 

4. Tick scheduling 
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Schedulability of Fixed Priority Tasks with Short Response Time 


1. Critical Instant: The response time of a job Ji* in T has the maximum possible value if the k-th period, which begins when 
J,, k is released, is a peak frame and this peak frame begins at the same time as a peak frame in every high-priority task. 

2. To determine whether a task can meet all its deadlines: 


<!— [if IsupportLists]-- >— <!—[endif]—>Compute the total demand for processor time by a job released at a 

critical instant of the task and by all the higher-priority tasks as a function of time from the critical instant. 
<!- [if IsupportLists] —>— <!—[endif]—>Check whether this demand can be met before the deadline of the job. 

3. Busy Intervals: A level-busy interval (to, t] begins at an instant to when 
All jobs in T released before the instant have completed. 

- <!—[endif]— >A job in T is released. 

4. Practical Factors: Non-preemptable, Self-suspension, Context switch, Tick scheduling 


For more CSIT notes by ASCOL teachers and students ping: http://csitascolhelp.bloqspot.com/ 




7.1 Priority Driven Scheduling of Aperiodic and Sporadic Tasks 


Priority Driven Scheduling of Aperiodic and Sporadic Tasks 

Assumptions and approaches 

1. Single processors system with independent preemptable periodic task, which are scheduled using a priority driven 
algorithm:- (i) The parameters of all periodic tasks are known, (ii) In the absence of aperiodic and sporadic jobs, the 
periodic jobs meet deadlines. 

2. Aperiodic and sporadic jobs may exist and they are independent of each other and of the periodic task. They can be 
preempted at any time. 

3. The parameters of sporadic jobs become known after they are released. 

4. Some sporadic jobs may not be possible to meet their deadlines. 

Therefore, either reject the sporadic jobs that cannot complete in time or accept all sporadic jobs and allow some of them 
to complete late. 

Objective, Correctness of Optimality 

The operating system maintains priority queue of periodic, aperiodic and sporadic jobs. Acceptance jobs are placed into 
priority queue and each type of job is queued separately with queuing discipline as shown in the figure of system model 
below. 



system. 


Priority queues maintained by the operating 


The scheduler selects the job at the head of the queue and provides to the processor in order to execute it. 

The scheduling problems associated with the execution of a sporadic jobs is related with a determination of when 
aperiodic and sporadic jobs are executed. Based on the execution time and deadline of each newly arrived sporadic job, we 
can decide whether to accept or reject the job. Accepting the job means that the job will complete within the deadline 
without causing any aperiodic task or other sporadic task to missed their deadline. 

The problem is also related with how to do acceptance test and how to schedule accepted jobs. For this, the aperiodic jobs 
are always accepted and executed as soon as possible without causing periodic jobs and accepted sporadic jobs to miss 
their deadlines. 


Correctness 


The correct schedule is one where all periodic tasks and any sporadic tasks that have been accepted do not miss deadlines. 
A scheduling algorithm supporting aperiodic and sporadic jobs is a correct algorithm if it only produces correct schedules 
for the system. 
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Optimality 

If the queuing discipline used to order aperiodic jobs is given then an aperiodic job scheduling algorithm is optimal if it 
minimizes either. 


1. The response time of the job at the of the aperiodic job queue or 

2. The average response time of all aperiodic jobs for a given queuing discipline. 

A sporadic job scheduling algorithm is optimal if it accepts a new sporadic job and schedules that job to complete y its 
deadline if and only if the new job can be correctly scheduled to complete in time. An optimal algorithm always produces 
a feasible schedule for the accepted jobs. 

Deferrable Server 


A deferrable server is the simplest of bandwidth-preserving servers. The deferrable server can be described based on 
consumption and replenishment rules that define the deferrable server (P„ e,). 

The consumption rule is, 


• The execution budget of the server is consumed at the rate of one per unit time whenever the server executes. 
The replenishment rule is, 


The execution budget of the server is set to e s at time instances kp t for k = 0, 1, 2,. 

In the deferrable server, the budget is not allowed to cumulate from period to period. It means the budget is held by the 
server immediately before each replenishment time is lost. 


Consider, a deferrable server Tds (3, 1), Ti = (2, 3.5, 1.5) and T> = (6.5, 0.5). These tasks 
Also, consider that aperiodic job A occur at t = 2.8 and its execution time is 1.7. 

r »r < 3 -»I_[£□_tl_L 

0 3 6 9 

tx 2 - 3 - 5 -‘- 5 »tn_ r~i h r~i _t 


scheduled rate monotonically. 


r 2 =(6.5,0.5)|Q 

o 


Budget 

i.o -- 

0.5 - 


_L_Q 

6.5 



3 6 9 


deferrable server 


the operations of; 


1. At time 0, the server is given 1 unit of budget. The budget stays at 1 until time 2.8. When A arrives, the deferrable server 
executes the job. Its budget decreases as it executes. 

2. Immediately before the replenishment time 3.0, its budget is equal to 0.8. This 0.8 unit is lost at time 3.0, but the server 
acquires a new unit of budget. Hence, the server continues to execute. 

3. At time 4.0, its budget is exhausted. The server is suspended, and the aperiodic job A waits. 

4. At time 6.0, its budget replenished, the server resumes to execute A. 

5. At time 6.5, job A completes. The server still has 0.5 unit of budget. Since no aperiodic job waits in the queue, the server 
suspends itself holding this budget 
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Sporadic Server 

Definition: 

T h is the subset of periodic tasks with higher priorities than the server. 
t r is the lost time the servet budget replenished, 
tf is the first instant after tr at which the server begins to execute 
te is the last effective replenishment time 

1. BEGIN as the start of earliest busy interval in the most recent contiguous sequence of busy intervals of T H starting before t. 

2. END as the end of the last busy interval in this sequence if this interval ends before t and define END = cc if the interval 
ends after t. 

Sporadic Server in Fixed-Priority Systems 

The rules includes the following. 

Consumption rule: 

At any time t after t r , if the server has budget and if either of the following two conditions is true, the server budget is 
consumed at the rate of one per unit time. 

Condition 1 (Cl): the server is executive. 

Condition 2 (C2): the server has executed. 

Replenishment Rule: 

There are three rules: 

R1: When the system begins executing and each time budget is replenished, set the budget to e, and t, = current time. 

R2: When the server begins to execute, 

if END = tf, then 

t, = max(t,, BEGIN) 

else if END < t f then t = t f 

the next replenishment time is set to t, + P,. 

R3: The next replenishment occurs at the next replenishment time (te + Ps) except under the following conditions 

1. If te + P. is earlier than t f , the budget is replenished as soon as it is exhausted. 

2. If T becomes idle before t e + P s and becomes busy again at t b , the budget is replenished at min(t„ + p„ t b ). 

Consider, the budget of the server (5,1.5) is 1.5 initially. It is scheduled rate-monotonically with three periodic tasks: T1 = 
(3, 0.5), T2 = (4, 1.0), and T3 = (19, 4.5). They are schedulable even when the aperiodic job queue is busy all the time. 
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the operations of a simple sporadic server 

1. From time 0 to 3, the aperiodic job queue is empty and the server is suspended. Since it has not executed, its budget stays 
at 1.5. At time 3, the aperiodic job Ai with execution time 1.0 arrives; the server becomes ready. Since the higher-priority 
task (3, 0.5) has a job ready for execution, the server and the aperiodic job wait. 

2. The server does not begin to execute until time 3.5. At the time, t, is 0, BEGIN is equal to 3, and END is equal to 3.5. 
According to rule R2, the effective replenishment time t is equal to max(0, 3.0) = 3, and the next replenishment time is set 
at 8. 

3. The server executes until time 4; while it executes, its budget decreases with time. 

4. At time 4, the server is preempted by T 2 . While it is preempted, it holds on to its budget. 

5. After the server resumes execution at 5, its budget is consumed until exhaustion because it executes (Cl) and then, when it 
is suspended again, T1 and T2 are idle (or equivalently, END, which is 5.0, is less than the current time) (C2). 

6. When the aperiodic job A 2 arrives at time 7, the budget of the server is exhausted; the job waits in the queue. 

7. At time 8, its budget replenished (R3), the server is ready for execution again. 

8. At time 9.5, the server begins to execute for the time since 8. t„ is equal to the latest replenishment time 8. Hence the next 
replenishment time is 13. The server executes until its budget is exhausted at 11; it is suspended and waits for the next 
replenishment time. In the meantime, A 2 waits in the queue. 

9. Its budget replenished at time 13, the server is again scheduled and begins to execute at time 13.5. This time, the next 
replenishment time is set at 18. However at 13.5, the periodic task system T becomes idle. Rather than 18, the budget is 
replenished at 15, when a new busy interval of T begins, according to rule R3b. 

10. The behavior of the later segment also obeys the above-stated rules. In particular, rule R3b allows the server budget to be 
replenished at 19. (Liu 205-207) 
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Priority Driven Scheduling of Aperiodic and Sporadic Tasks 



1. The budget of a deferrable server (p s ,es) is set to es at time instants kp„ for k = 0,1,2,.... The server consumes 
only when it executes. 

2. A simple sporadic server (p s ,e,) emulates aperiodic task of period p, and execution time 


budget 
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7.2 Other Bandwidth Preserving Services 


Other Bandwidth Preserving Services 

There are three other bandwidth preserving server algorithms that give a simple way to schedule aperiodic jobs in a 
deadline driven system. They are 

1. Constant Utilization Server 

2. Total Bandwidth server 

3. Weighted Fair Queuing Server 

All these algorithms are approximations of an ideal generalized processor sharing (GPS) algorithm. The objective is to 
assign a portion of available processor time to a task making it believe that it was execution on a slower processor 
independent of any other task. It also provides fair schedule, timing isolation or guaranteed server in each round a very 
small amount of time slice of length proportional to server size. 


Constant Utilization Server 

The server that is used to schedule aperiodic jobs along with periodic tasks in a deadline driven system is called constant 
utilization server. A constant utilization server reserves a non-fiction u,’ (instantaneous utilization) of the processor time 
for execution of the server. Like other bandwidth preserving server, it has a budget and is defined in terms of consumption 
and replenishment rules. 

When the budget is non-zero, the same server is scheduled with tasks on an EDF basis. The budget and deadline of the 
server are chosen such that the utilization of the server is constant when it executes and it is always given enough budget 
to complete ate the head of the queue each time its budget is replenished. The server never has budget if it has no work to 
do. 

Consumption Rule: 

A constant utilization server only consumes budget only when it executes. 

Replenishment Rule: 

R1: Initially, budget e, = 0 and deadline d = 0 

R2: When an aperiodic job with execution time e arrives at time t to an empty aperiodic job queue, 

1. If t<d, do nothing 

2. If t>=d, then set d = t+(e/u,’) and c s =e 
R3: At the deadline d of the servers, 

1. If the server is backlogged, set d = d + (e/u s ’) and e s = e. 

2. If the server is idle, do nothing. 

Consider, Ti + (4, 0.5), T 2 = (4, 1), T 3 = (19, 4.5) and u,’ = 0.25. Similarly, the aperiodic jobs A1 = (3, 1), A2 = (6.9, 2) and 
A3 = (15.5, 2). 

The timing diagram for the execution of these tasks is below: 
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Total Bandwidth Server 

The constant utilization server gives a non-fraction of processor capacity to a task, but it cannot claim unused capacity to 
complete the task earlier. The total bandwidth server improves the responsiveness by permitting a server to claim 
background time not used by the periodic task. This is achieved by having the scheduler replenish the server budget as 
soon as the budget is exhausted if the server is backlogged or as soon as the server becomes backlogged. For, a given set 
aperiodic jobs and server size, both total bandwidth server and constant utilization server have the same sequence of 
deadlines but the budget of a total bandwidth server may be replenished earlier than that of a constant utilization server. 

Consumption rule: 

A total bandwidth server only consumes budget when it executes. 

Replenishment rule: 

R1: Initially, e, = 0 and d = 0 

R2: When an aperiodic job executes arrive, at time t to an empty periodic job queue, set di=max (d, t) + c/u,’ and c, = e. 
R3: When the server completes current aperiodic the job is removed from the queue and 

1. If the server is backlogged, set d = d + e/u,’ and e s = e 

2. If the server is idle, do nothing 

Weighted Fair Queuing Server 

The objective of constant utilization server and total bandwidth server is to assign some fraction of processor capacity to c 
task. When assigning the capacity there is the issue of fairness. A scheduling algorithm is said to be fair within any 
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particular time interval of the fraction of processor time in the interval obtained by each backlogged server is proportional 
to the serve size. In such cases, not only all the tasks met their deadline, but they all make continual progress according to 
their share of the processor and no starvation occurs. 
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Other Bandwidth Preserving Services 


1. The server that is used to schedule aperiodic jobs along with periodic tasks in a deadline driven system is called constant 
utilization server. 

2. The total bandwidth server improves the responsiveness by permitting a server to claim background time not used by the 
periodic task. 
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7.3 Slack Stealing in Deadline Driven System 


Slack stealing in deadline driven system 

Consider, a system of two periodic tasks, T1 = (2.0, 3.5, 1.5) and T2 = (6.5, 0.5). In addition to the aperiodic job that has 
execution time 1.7 and is released at 2.8, suppose another aperiodic job with execution time 2.5 is released at time 5.5. 
These jobs are A1 and A2, respectively. The figure below shows the operation of a slack stealer. 

t < h. . nl. . i 1 1 i _L 

2 4 6 8 10 

^ b . 

0 


slack _ | | | ^2 | ^ 

stealer 


2.5 

a 2 


Example: slack stealer in a 


1. Initially, the slack stealer is suspended because the aperiodic job queue is empty. When Aiarrives at 2.8, the slack stealer 
resumes. Because the execution of the last 0.7 units of Ju can be postponed until time 4.8 (i.e., 5.5-0.7) and T 2 has no 
ready job at the time, the system has 2 units of slack. The slack stealer is given the highest priority. It preempts J u and 
starts to execute Ai. As it executes, the slack of the system is consumed at the rate of 1 per unit time. 

2. At time 4.5, Ai The slack stealer is suspended. The job Ju resumes and executes to completion on time. 

3. At time 5.5, A 2 arrives, and the slack stealer becomes ready again. At this time, the execution of the second job Ji, 2 of 

Ti can be postponed until time 7.5, and the second job J 2 , 2 of T 2 can be postponed until 12.5.Hence, the system as a whole 
has 2.0 units of slack. The slack stealer has the highest priority starting from this time. It executes A 2 . 

4. At time 7.5, all the slack consumed, the slack stealer is given the lowest priority. Ji, 2 preempts the slack stealer and starts to 
execute. 

5. At time 9, Ju completes, and the system again has slack. The slack stealer now has the highest priority. It continues to 
execute A 2 . 

6. When A 2 completes, the slack stealer is suspended again. For as long as there is no job in the aperiodic job queue, the 
periodic tasks execute on the EDF basis. 

In principle, slack stealing in a priority-driven system is almost as straightforward as in a clock-driven system as shown by 
the figure. The key step in slack stealing is the computation to determine whether the system has any slack or not. While 
this is a simple step in a clock-driven system, it is considerably more complex in a priority-driven system. The remainder 
of this section and most of the following section are devoted to algorithms for computing the amount of available slack. 
These algorithms are called slack computation algorithms. 

A slack computation algorithm is correct if it never says that the system has slack when the system does not, since doing 
so may cause a periodic job to complete too late. An optimal slack computation algorithm gives the exact amount of slack 
the system has at the time of the computation. Therefore, it is correct. A correct slack computation algorithm that is not 
optimal gives a lower bound to the available slack. (Liu 233-235) 
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Slack Stealing in Fixed priority System 


In principle, slack stealing in a fixed-priority system works in the same way as slack stealing in a deadline-driven system. 
However, the computation and the usage of the slack are both more complicated in fixed-priority systems. 

Consider, the system contains three periodic tasks Ti = (3, 1), T 2 = (4, 1), and T 3 = (6, 1). 
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The example points out the following: 



No slack-stealing algorithm can minimize the response time of every aperiodic job in a fixed-priority system even when 
prior knowledge on the arrival times and execution times of aperiodic jobs is available. 

The amount of slack a fixed-priority system has in a time interval may depend on when the slack is used. To minimize the 
response time of an aperiodic job, the decision on when to schedule the job must take into account the execution time of 


the job. (Liu 244) 

Scheduling of Sporadic Jobs 

Recall the sporadic job scheduling problem: - 

Based on the execution time and deadline of each newly arrived sporadic job, decide whether to accept or reject the job 
Accepting the job implies that the job will complete within its deadline, without causing any periodic task or previously 
accepted sporadic job to miss its deadline 

Do not accept a sporadic job if cannot guarantee it will meet its deadline 

Model for Scheduling Sporadic Jobs 

When sporadic jobs arrive, they are both accepted and scheduled in EDF order 
In a dynamic-priority system, this is the natural order of execution 
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In a fixed-priority system, the sporadic jobs are executed by a bandwidth preserving server, which performs an acceptance 

test and runs the sporadic jobs in EDF order 

In both cases, no new scheduling algorithm is required 

Definitions: - 

Sporadic jobs are denoted by Si(r,, d, a) where r, is the release time, d, is the (absolute) deadline, and c, is the maximum 
execution time 

The density of a sporadic job Ai =ei/(d -r.) • The total density of a system of n jobs is A =Ai+A 2 + ... +A„ 

The job is active during its feasible interval (ri,d ] 

Sporadic Jobs in Dynamic priority System 

Theorem: 

A system of independent preemptable sporadic jobs is schedulable according to the EDF algorithm if the total density of 
all active jobs in the system < 1 at all times 

This is the standard schedulability test for EDF systems, but including both periodic and sporadic jobs 

This test uses the density since deadlines may not equal periods; hence it is a sufficient test, but not a necessary test 

This means: 

If we can bound the frequency with which sporadic jobs appear to the running system, we can guarantee that none are 
missed 

Alternatively, when a sporadic job arrives, if we deduce that the total density would exceed 1 in its feasible interval, we 
reject the sporadic job (admission control) 

Real time performance of jobs with soft timing constraints 

Traffic Models 

Specifically, when requesting admission into the system, each sporadic task presents to the scheduler its traffic parameters. 
And the collection of these parameters are called the flow specification of the task in communications literature. They 
define the constraints on the inter-arrival times and execution times of jobs in the task. The performance guarantee 
provided by the system to each task is conditional which means that the system delivers the guaranteed performance 
conditioned on the fact that the task meets the constraints defiAAAAA* ned by its traffic parameters. The flip side is that 
if the task misbehaves (that is, its actual parameters deviate from its traffic parameters), the performance experienced by it 
may deteriorate. Different traffiAAAAA* c models use different traffic parameters to specify the behavior of a sporadic 
tasks. 

Leaky bucket model 

We first introduce the notion of a (A, E) leaky bucket filter, then define the leaky bucket model. This kind of fillter is 
specified by its input rate A and size E: The filter can hold at most E tokens at any time and it is filled with tokens at a 
constant input rate of A tokens per unit time. A token is lost if it arrives at the filter if the filter already contains E tokens. 

It can be assumed of sporadic task that meets the (A, E) leaky bucket constraint as if its jobs were generated by the filter in 
the following manner. The filter may release a job with execution time e when it has at least e tokens. After the release of 
this job, the e tokens are removed from the filter. A job cannot be released when the filter has no token. Therefore, not any 
job in a sporadic task that satisfies the (A, E) leaky bucket constraint, has execution time larger than E. The total execution 
time of all the jobs which are released within any time interval of length E/A is less than 2E. A periodic task with period 
equal to or larger than E/ A and execution time equal to or less than E satisfies the (A, E) leaky bucket constraint. A 
sporadic task S = (1, p, p’, I) that fits the FeVe model satisfiAAAAA• es the constraint if p’ = 1/ A and E = (1— p/p’)I/p’. 
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Slack Stealing in Deadline Driven System 


The key step in slack stealing is the computation to determine whether the system has any slack or not. 

A slack computation algorithm is correct if it never says that the system has slack when the system does not, since doing 
so may cause a periodic job to complete too late. 

To minimize the response time of an aperiodic job, the decision on when to schedule the job must take into account the 
execution time of the job. 

Sporadic jobs are denoted by Si(ri, d, a) where r, is the release time, d is the (absolute) deadline, and c, is the maximum 
execution time 
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8.1 Resources and Resource Access Control 


Resources and Resource Access Control Assumption 

1. The system contain only one processor 

2. The system contains three types of serially reusable resources named Ri, R 3 ,.R„ 

3. Serially reusable resources are allocated to jobs on a non-preemptive basis and used in a mutually exclusive manner. 

4. Some resources can be used by more than one job at the same time. 

Enforcement of Mutual Exclusion and Critical Sections 

0 2 4 6 8 iO 12 14 16 18 

J x . URJ L(R 2 ) . U(R 2 ) ... U(R 3 ) . 


J 2 ... L(R 2 ) ... L(R 3 )..U(R 3 )..U(R 2 ) L(R 3 ) ... U(R 3 ) 


J 3 . L(R X ) ... L(R f 3) ... L(R r 4)... U(R p ,4)...U(R r 3)..U(R x ) 

Examples of 

critical sections 

The lock based concurrency control mechanism is used to enforce mutually exclusive access of jobs to the resources. 
When only jobs wants to use ip units of R resources, it executes a lock to request the resource. This request is represented 
by L(Ri, r|). When the job is granted the resource, it continues to execute and when it does not need the resource, the 
resource is released by executing unlock U(Ri, q ). 

A segment of a job that begins at a lock and ends with at a matching unlock is called a critical section. The resources are 
released in the last in first out (FIFO) order and it helps to properly nest the overlapping critical sections. A critical section 
that is not included in other critical section that is not included in other sections is called outermost critical section. When 
the resource being used has number of units and the time period of execution, then it is represented as [R, q, ej. 

Resource Control and Blocking 

The resource conflict of the jobs conflict with one another occurs when the resource requirement of the jobs is same. It 
means the resource conflict occurs when two jobs require the same resource at the same time. The job contend for a 
resource when a job request a resource that is being used by other jobs. It means a job request for a lock cannot be granted 
by the scheduler. The lock request fails and the job is blocked and it loses the processor. The blocked job is removed from 
the ready job queue. It can be unblocked only when the required resource becomes free and granted to the job. 

Consider the following: 

J, = (6, 14) 
h = (2, 17) 

J 3 = (0, 18) 

These jobs are scheduled on the processor on EDF basis. The priority is Ji>J2>J3. All these jobs require a resource R that 
has only one unit. Assume critical section (Ri, 2), (R, 4) and (R, 4). 
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Effects of resource contention and Resource Access Control 


A resource access control protocol or access control protocol is a set of rules that govern: 

1. When and under what conditions each request for resource is granted. 

2. How the jobs requiring the resources are scheduled. 

There are three types of effects of resource contention 

1. Priority Inversion 

2. Timing Anomalies 

3. Deadlock 

Priority Inversion 

Priority inversion can occur when the execution of some jobs or portions of jobs is non-preemptable. Resource contentions 
among jobs can also cause priority inversion. Because resources are allocated to jobs on a non-preemptive basis, a higher- 
priority job can be blocked by a lower-priority job if the jobs conflict, even when the execution of both jobs is 
preemptable. In the figure A, the lowest priority job Tfirst blocks T and then blocks Ji while it holds the resource R. As a 
result, priority inversion occurs in intervals (4, 6] and (8, 9], 




fig B - Example illustrating 


When priority inversion occurs, timing anomalies follow. Figure B gives an example. The three jobs are the same as those 
shown in A, except that the critical section in J3 is [R; 2.5]. It can also be said that the execution time of the critical section 
in J 3 is shortened by 1.5. As a consequence of this reduction in J3’s execution time, all jobs should complete sooner. This 
reduction allows jobs J 2 and J 3 to complete faster. But, rather than meeting its deadline at 14, T misses its deadline because 
it does not complete until 14.5. 

Without good resource access control, the duration of a priority inversion can be unbounded which is illustrated in 
example below (figure C). Here, jobs Ji and J 3 have the highest priority and lowest priority, respectively. 
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fig C - Uncontrolled priority 


1. At time 0, h becomes ready and executes. It acquires the resource R shortly afterwards and continues to execute. 

2. After R is allocated to J 3 , Ji becomes ready. It preempts J 3 and executes until it requests resource R at time 3. 

3. Because the resource is in use, Ji becomes blocked, and a priority inversion begins. 

4. While J 3 is holding the resource and executes, a job J 2 with a priority higher than J 3 but lower than Ji is released. 

5. Moreover, J 2 does not require the resource R. This job preempts J 3 and executes to completion. Thus, J 2 lengthens the 
duration of this priority inversion. In this situation, the priority inversion is said to be uncontrolled. 

There can be an arbitrary number of jobs with priorities lower than J i and higher than I 3 released in the meantime. They can 
further increase the length of the duration of the priority inversion. When priority inversion is uncontrolled, a job can be 
blocked for a long time even to infinity. 

Non-preemptivity of resource allocation can also lead to deadlocks. For example, there are two jobs and both require 
resources X and Y. The jobs are in deadlock when one of them holds X and requests for Y, while the other holds Y and 
requests for X. The conditions that allow this circular wait of jobs for each other (that is, a deadlock) to occur are well- 
known. 

These examples illustrates that no resource access-control protocol can eliminate the priority inversion and anomalous 
behavior caused by resource contention. A more realistic goal of such a protocol is that it keeps the delays thus incurred as 
short as possible. For this reason, a criterion we use to measure the performance of a resource access-control protocol is 
the blocking time of each job. A good resource access control protocol should control priority inversion and prevent 
deadlock and, hence, keep the blocking time of every job bounded from the above. (Liu 280-282) 

Non Preemptive Critical Section 

The critical sections of the jobs can be scheduled non-preemptivity on a processor. It means, when a job request for 
resource it is always granted the resource and when the job holds the resource, it is executed with highest priority than 
other jobs. This protocol is called non-preemptive critical section protocol (NPCS). This protocol helps to avoid the 
deadlock because jobs are never preempted. 

Basic Priority Inheritance Protocol 

The priority-inheritance protocol is a simple protocol. It works with any preemptive, priority-driven scheduling algorithm. 
Like the NPCS protocol, it does not require prior knowledge on resource requirements of jobs. The priority-inheritance 
protocol does not prevent deadlock. When there is no deadlock (that is, when some other method is used to prevent 
deadlock), the protocol ensures that no job is ever blocked for an indefiAAA* nitely long time because uncontrolled 
priority inversion cannot occur. 

Definition of basic priority inheritance protocol 

The basic priority inheritance protocol is defined by the following rules: 

Scheduling Rule: 

Ready jobs are scheduled on the processor preemptively in a priority driven manner according to their current priorities. 

At its release time t, the current priority 7t(t) of every job J is equal to its assigned priority. The job remains at this priority 
except under the condition stated in rule 3. 

Allocation Rule: 
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When a job J requests a resource R at time t, 

if R is free, R is allocated to J until J releases the resource, and 
if R is not free, the request is denied and J is blocked. 

Priority Inheritance Rule: 

When the requesting job J becomes blocked, the job Ji which blocks J inherits the current priority 7t(t) of J. The job 
Ji executes at its inherited priority 7i(t) until it releases R; at that time, the priority of Ji returns to its priority 7Ci(t) at the ti 
t when it acquires the resource R. (Liu 286) 

Consider the following jobs: 
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Schedule under priority 

inheritance 

Properties of Priority Inheritance Protocol 

1. Direct Blocking - low priority job blocks the high priority jobs as the resource is being used by low priority jobs. 

2. Priority Inheritance Blocking - priority of high priority job is inherited i.e. transfer to low priority job to blocking jobs. 

3. Transitive Blocking - priority of job is transferred two times and block. 

Short coming of priority inheritance protocol 

1. Doesn’t prevent deadlock 

2. Doesn’t reduce the blocking time suffered by jobs as small as possible 
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Resources and Resource Access Control 



1. A resource that has an infinite number of units has no effect on the timing behavior of any job since every job can have 
the resource at any time; there is no need to include the resource in our model. Therefore, we lose no generality by 
assuming that every resource Ri has a finite number of units. 


2. we call a segment of a job that begins at a lock and ends at a matching unlock a critical section. Furthermore, resources 
are released in the lastin-first-out order. Hence overlapping critical sections are properly nested. 

3. It means the resource conflict occurs when two jobs require the same resource at the same time. 

4. A resource access control protocol or access control protocol is a set of rules that govern (i. When and under what 
conditions each request for resource is granted, ii. How the jobs requiring the resources are scheduled.) 

5. Types of effects of resource contention (Priority Inversion, Timing Anomalies, Deadlock) 

6. The priority-inheritance protocol does not prevent deadlock. 
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8.2 Basic Priority Ceiling Protocol 


Basic Priority Ceiling Protocol 

The priority-ceiling protocol extends the priority-inheritance protocol to prevent deadlocks and to further reduce the 
blocking time. This protocol makes two key assumptions: 

1. The assigned priorities of all jobs are fixed. 

2. The resources required by all jobs are known a priori before the execution of any job begins 

3. 

Definition of the Basic Priority-Ceiling Protocol 

It can be defined by the following rules: 

Scheduling Rule: 

3. At its release time t, the current priority 7i(t) of every job J is equal to its assigned priority. The job remains at this 

priority except under the condition stated in rule 3. 

4. Every ready job J is scheduled preemptively and in a priority-driven manner at its current priority 7i(t). 

Allocation Rule: 

Whenever a job J requests a resource R at time t, one of the following two conditions occurs: 

1. R is held by another job. J’s request fails and J becomes blocked. 

2. R is free. 

3. If J’s priority 3i(t) is higher than the current priority ceiling 7t’(t), R is allocated to J. 

4. If J’s priority 7i(t) is not higher than the ceiling ji’(t) of the system, R is allocated to J only if J is the job holding the 
resource(s) whose priority ceiling is equal to 7t’(t); otherwise, J’s request is denied, and J becomes blocked. 
Priority-Inheritance Rule: 

When J becomes blocked, the job Ji which blocks J inherits the current priority 7t(t) of J. Jiexecutes at its inherited priority 
until the time when it releases every resource whose priority ceiling is equal to or higher than 7i(t); at that time, the priority 
of Ji returns to its priority jti(t) at the time t when it was granted the resource(s). 


Example below shows the schedule of the system of jobs. 
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ceiling protocol 


A schedule illustrating priority- 


Deadlock Avoidance by Priority-Ceiling Protocol 

One of the way to avoid deadlock is to use the ordered resource technique i.e. the priority ceiling of the resources imposes 
a linear order on all the resources. 


Consider three jobs Ji, J 2 and T with priorities 1, 2 and 3 respectively. The release times are 3.5, 1 and o and their critical 
sections are oritiesl, 2, and 3, respectively. Their release times are 3.5, 1, and 0 and their critical sections are [Dotted; 1.5], 
[Black; 2[Shaded; 0.7]], and [Shaded; 4.2 [Black; 2.3]], respectively. The timing diagram for these jobs is shown below: 

0 2 4 6 8 10 12 14 

I I I I I I I I 



JZL 


ri(Dotted) = jcj, U(Black) = Yl(Shaded) = n 2 


Example illustrating 
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how priority-ceilling protocol prevents deadlock 
The executions of jobs can be described as follows: 

1. At time t = 0, only T 3 is released and therefore, it starts execution. When J 3 requests shaded at time t = 0.5, no resource is 
allocated at that time and therefore J 3 ’s request is granted. When job T becomes ready at time t = 1, it preempts J 3 . 

2. At time t = 2.5, I 2 request black. Because shaded is already allocated to J 3 and has priority ceiling of the system 2, which is 
equal to the priority of J 2 . According to (ii) of part (b) of rule 2, J 2 is denied block even though the resource is free. Since, 
J 2 is blocked, J 3 inherits the priority 2, resumes and starts to execute. 

3. When J 3 request black at time t = 3, it is holding the resource whose priority ceiling is the current ceiling of the system. 
According to (ii) of part (b) of rule 2, J 3 is granted the resource black and it continues to execute. 

4. J 3 is preempted again at time t = 3.5 when T becomes ready. When Ji request dotted at time t = 4.5, the resource is free and 
the priority of Ji is higher than the ceiling of system (i) of part (b) of rule 2 applies and the dotted resource is allocated to 
Ji which complete at 7.2 and the process continues. 


Stack Based Priority Ceiling (Ceiling Priority) Protocol 

Stack-based priority ceiling protocol is simpler than the priority ceiling protocol but has the same worst-case performance. 
Two different definitions of stack based protocols can be obtained based on two different motivations. 

1. To provide stack sharing capability 

2. To simplify the priority ceiling protocol 

Motivation and Definition of Stack Sharing Priority Ceiling Protocol 

One of the resource in the system is the runtime stack. Each job may have its own runtime stack, but where the number of 
jobs is large. It may be necessary to share a common runtime stack in order to reduce the overall memory demand. The 
space in the stack is allocated to the jobs contiguously in the last in first out (LIFO) manner. Therefore, when a job 
executes, its stack space is on the top of the stack. The space is freed only when the job completes. When the job J is 
preempted , the preempting job has the stack space above J. J can resume execution only after all the jobs holding the stack 
space leave J’s stack space on the top of the stack. 

In this protocol, deadline can occur. The causes of deadline can be: 

1. When a high priority job preempts the low priority job and occupies the top of the stack space. 

2. When the low priority job has already been using the resource that is required by the high priority job. 

To ensure the deadlock-free sharing of the runtime stack among the jobs, no job should ever blocked because it is denied 
some resources once it begins execution. This condition leads to the modified version of priority ceiling protocol called 
stack based priority ceiling protocol. This protocol allows to share the runtime stack among the jobs if they never self¬ 
suspend. 

The rules defining the basic stack-based priority ceiling protocol are 
0 rule - Update of the current rule 

Whenever all the resources are free, the ceiling of the system is £2. The ceiling 7t’(t) is updated each time a resource is 
allocated or freed. 

Scheduling Rule: 

After a job is released, it is blocked from starting execution until its assigned priority is higher than the current ceiling 7i’(t) 
of the system. At all times, jobs that are not blocked are scheduled on the processor in a priority-driven, preemptive 
manner according to their assigned priorities. 

Allocation Rule: 

Whenever a job requests a resource, it is allocated the resource. 

Consider the following jobs 
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Schedule illustrating the 

stack-based, priority-ceiling protocol 

Definition of ceiling Priority Protocol 

The worst case performance of the stack based and basic priority ceiling protocols are the same. But the stack-based 
protocol is simpler and has a lower context switch overhead. Due to these reasons, the stack based protocols are preferred 
if the job never self-suspend. When the resource requirement of the jobs are known, the jobs holding any resource are 
allowed to execute at the highest priority of all the jobs requiring the resource. This is called ceiling priority protocol. 

The rules defined by this protocol are 
Scheduling protocol 

1. Every job executes at its assigned priority when it does not hold any resource. Jobs of the same priority are scheduled on 
the FIFO basis. 

2. The priority of each job holding any resource is equal to the highest of the priority ceilings of all resources held by the j ob. 
Allocation Rule: 

Whenever a job requests a resource, it is allocated the resource. 

Use of Priority Ceiling Protocol in Dynamic Priority Systems 

In a dynamic priority systems, the priorities of periodic task change with the time, while the resources required by each 
task remain constant. As a result, the priority ceilings of the resources may change with time. For some dynamic systems, 
we can use the priority ceiling protocol to control resources accesses provided by the priority ceiling of each resource and 
the ceiling of the system is updated each time the priority change. 

Implementation of Priority Ceiling Protocol in Dynamic Priority System 

The priority ceiling protocol can be used in job level fixed priority system by updating the priority ceiling of all the 
resources whenever a new job is released. When a new job is released, its priority relative to all the jobs in the ready queue 
is assigned according to the given dynamic priority algorithm. After this, priority ceilings of all the resources are updated 
based on the new priorities of the task and the ceiling of the system is updated based on the new priority ceiling is used 
until it is updated again when the next job releases. This protocol is considered effective because it prevents deadlock and 
transitive blocking and no job is ever blocked for longer than the length of one critical section in a job level fixed priority 
system. 

Consider the following three tasks. 

T, = (0.5, 2.0, 0.2; [Black; 0.2]), 
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T 2 = (3.0, 1.5; [Shaded; 0.7]), and 
T, = (5.0,1.2; [Black; 1.0 [Shaded; 0.4]) 

The priority ceilings of the two resources black and shaded are updated at 0, 0.5, 2.5, 3, 4.5, 5, 6 and so on. The priorities 
of the ready jobs are indicated by positive integers with one (1) having highest priority. The following timing diagram 
illustrates the use of basic priority ceiling protocol in an EDF system of above jobs. 
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Example illustrating the 

use of the basic priority-ceiling protocol in an EDF system 
The timing diagram can be described as follows: 

1. At time t = 0, only two jobs J 2 .i and J 3 ,i are released. Since, the deadline of J 2 is earlier than the deadline of J w , i.e. 3 & 5. 
Therefore, Ji has priority 1 & T has 2. The priority of the resources black and shaded are 2 & 1 respectively. Therefore, job 
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J 2 .i starts execution and since no resources has been used in the beginning, the priority ceiling of the system is omega (£1). 
At time t = 0.3, J 2 .i acquires resources shaded and the ceiling of the system moves from omega to 1. 

2. At time t = 0.5, the first job J u of Ti is released and it has the highest priority i.e. 1. The priorities of J21 & J3.1 are 2 & 3 

respectively. Since, J2.1 requires the resource black, the ceiling of the resource also becomes 1 and the ceiling of the shaded 
becomes 2. Therefore, the resource black is granted to the high priority job Jm and it completes by the time t = 0.7. After 
that the job J2.1 continues to execute and the ceiling of the system falls back to 2 which is equal to the priority ceiling of 
resource shaded. 

3. At time t = 2.5, Ji, 2 is released and its priority is 1 because it has the earliest deadline, the priority of the ready job J 3 ,i is 2. 

This update of the task priorities have no change in priority ceiling of the resource. Since the ceiling of the system at t = 
2.5 is 1, job J 1,2 becomes blocked. It can be the resource block at t = 2.9. 

4. At time t = 3, Ti & T 2 have the jobs ready for execution and the priorities are 1 & 2 respectively. Therefore, the priority 
ceiling of the system remains unchanged until the next release of 4.5. 

5. At time t = 4.5, job Ju is released and its deadline is 6.5 which is later than the J22i.e. 6. Therefore, the priority of 

J2,2 becomes 1 and Ju becomes 2. This change in the priorities of task causes the change in the priority ceiling of black and 
shaded to 2 & 1. 

6. At time t = 5, J 32 is released and starts execution because there are no other ready jobs. The priority ceilings of both the 
resources arc 1 which remains same until t = 6. 

7. At time t = 6, both J 23 and J 32 are ready & J 23 has earlier deadline. Therefore, the same cycle starting from t = 0 is repeated. 

References 

Liu, Jane W. S. Real Time Systems. Integre Technical Publishing Co., Inc, January 10, 2000. Print. 


Basic Priority Ceiling Protocol 


1. The priority-ceiling protocol extends the priority-inheritance protocol to prevent deadlocks and to further reduce the 
blocking time. 

2. The different definitions of stack based protocols can be obtained based on two different motivations (To provide stack 
sharing capability, To simplify the priority ceiling protocol). 

3. According to the ceiling priority protocol, which is identical to the stack-based version of the priority-ceiling protocol, 
each job executes at its assigned priority when it does not hold any resource, and at the highest of priority ceilings of all 
resources held by the job whenever the job holds any resource. 

4. When the resource requirement of the jobs are known, the jo 

5. bs holding any resource are allowed to execute at the highest priority of all the jobs requiring the resource. This is called 
ceiling priority protocol. 
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8.3 Preemption Ceiling Protocol 


Preemption Ceiling Protocol 

Preemption Levels of Jobs and Periodic Tasks 

The parameter that is used to indicate the possibility that a job J, will preempt another job is called preemption level and 
represented by \|/. The preemption levels of jobs are functions of their priorities and released times. According to a valid 
preemption level assignment, for every pair of jobs T and jt, the preemption level \p of Ji being equal to or higher than 
preemption level \p of T means that it is never possible for J k to preempt T. That is validating condition: if tc, is higher than 
Tit & n > r k then \|/ s is higher than \|/ k . 

This can be illustrated with the help of the following example having jobs Jl, J2, J3, J4 & J5 with release times 
r 4 <r 5 <r 3 <ri<r2. 



Definitions of Protocols and Duration of Blocking 

A preemption-ceiling protocol makes decisions on whether to grant a free resource to any job based on the preemption 
level of the job in a war similar to the priority-ceiling protocol. This protocol also assumes that the resource requirements 
of all the jobs are known a priori. After assigning preemption levels to all the jobs, the preemption ceiling of each resource 
is determined. Specifically, when there is only 1 unit of each resource, that is assumed is the case here, the preemption 
ceiling \|/(R) of a resource R is the highest preemption level of all the jobs which require the resource. For the example in 
the figure above (i.e a schedule according to the preemption -ceiling protocol) the preemption ceiling of Black is 1, while 
the preemption ceiling of Shaded is 2. 

The ceiling of the system \|/'(t) at any time t is the highest preemption ceiling of all the resources which are in use at t. 
When the context is clear and there is no chance of confusion, we will simply refer to \|/'(t) as the ceiling of the system. f> 
is used to denote a preemption level that is lower than the lowest preemption level among all jobs since there is no 
possibility of confusion. When all the resources are free, it is said that the ceiling of the system is Q. (Liu 310-311) 

Rules of Basic Preemption-Ceiling Protocol 

1 and 3 


The scheduling rule (i.e., rule 1) and priority inheritance rule (i.e., rule 3) are the same as the corresponding rules of the 
priority-ceiling protocol. 

Allocation Rule: 


Whenever a job J requests resource R at time t, one of the following two conditions occurs: 

1. R is held by another job. J’s request fails, and J becomes blocked. 

2. R is free, (a) If J’s preemption level \|/(t) is higher than the current preemption ceiling \|/'(t) of the system, R is allocated to 
J. (b)If J’s preemption level \|/(t) is not higher than the ceiling \|/'(t) of the system, R is allocated to J only if J is the j ob 
holding the resource(s) whose preemption ceiling is equal to \|/(t); otherwise, J’s request is denied, and J becomes blocked. 
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Controlling Access to multiple Unit Resource 

We describe an extension to priority ceiling and preemption ceiling protocol so that they can deal with the general case 
where there may be more than 1 unit of each resource type. 

Priority (Preemption) Ceilings of Multiple Unit Resource 

The first step in extending the priority-ceiling protocol is to modify the definition of the priority ceilings of resources. We 
let n(Ri> k), for k < w, denote the priority ceiling of a resource Ri when k out of the v, (> 1) units of Ri are free. If one or 
more jobs in the system require more than k units of Ri, n( Ri ,k) is the highest priority of all these jobs. If no job requires 
more than k units of Ri, n( Ri ,k) is equal to flthe non-existing lowest priority. In this notation, the priority ceiling n( R i) of 
a resource Rj that has only 1 unit is n( R j>0). 

Let k(t) denote the number of units of Ri that are free at time t. Because this number changes with time, the priority ceiling 
of R changes with time. The (current) priority ceiling of the system at time t is equal to the highest priority ceiling of all 
the resources at the time.(Liu 313) 

For example. 
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Example of priority 


Controlling Concurrent Accesses to Data objects 

Data objects are a special type of shared resources. When jobs are scheduled preemptively, their accesses to (that is, reads 
& writes) data objects may be interleaved. For ensuring data integrity, it is common to require that the reads and writes be 
serializable. A sequence of reads and writes by a set of jobs is serializable if the effect produced by the sequence on all the 
data objects shared by the jobs is the same as the effect produced by a serial sequence (that is, the sequence of reads and 
writes when the jobs execute according to a non-preemptive schedule).(Liu 317-318) 

Convex-Ceiling Protocol 

The resource access-control protocols described in earlier sections do not ensure serializability. For eg., both the NPCS 
and PC (Priority- and Preemption-Ceiling) protocols allow a higher-priority job L to read and write a data object X 
between two disjoint critical sections of a lower-priority job Ji during which Ji also reads and writes X. The value of X thus 
produced may not be the same as the value produced by either of the two possible serial sequences (that is, all the reads 
and writes of Ji either proceed or follow that of L). 

Motivation and Assumptions 

Two Phase Locking (2PL) is a well-known way of ensuring serializability. According to this protocol, a job never requests 
any lock once it releases some lock. Thus, the critical sections of L and J 3 in Figure below: 
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Satisfy this protocol, but the critical sections of J 2 do not. Under the 2PL protocol, J 2 would have to hold the locks on 
R 2 and R 3 until time 16. (This is because the critical sections are also required to be properly nested.) We can easily get 
concurrency-control protocols that not only ensure serializability but also prevent deadlock and transitive blocking by 
augmenting the protocols described in earlier sections with the two-phase locking rule. As a result, we have the NPCS- 
2PL and the PCP2PL protocols. The augmented protocols have an obvious shortcoming: prolonged blocking. Following 
the 2PL rule, a job may hold a data object even when it no longer require the object. As a result, it may block other jobs 
for a longer duration. 

References 
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Preemption Ceiling Protocol 


1. The parameter that is used to indicate the possibility that a job Ji will preempt another job is called preemption level and 
represented by \p. 

2. The preemption levels of jobs are functions of their priorities and released times. 

3. Preemption level protocol also assumes that the resource requirements of all the jobs are known a priori. 

4. A sequence of reads and writes by a set of jobs is serializable if the effect produced by the sequence on all the data objects 
shared by the jobs is the same as the effect produced by a serial sequence. 

5. ccording to convex ceiling protocol, a job never requests any lock once it releases some lock. 
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9.1 Model of Multiprocessor and Distributed Systems 


• I3WB_ 

Model of multiprocessor and distributed systems 

The systems contains more than one processors. Some systems are known as multiprocessor systems while the others are 
known as distributed systems. A multiprocessor system is tightly coupled so that global status and workload information 
on all processors can be kept current at a low cost. The system uses a centralized dispatcher or scheduler. When each 
processor has its own scheduler, the actions and the decisions of the scheduler of all processors are coherent. In contrast, a 
distributed system is loosely coupled in such a system that it is costly to keep global status and workload information 
current. The schedulers on the different processors may make scheduling and resource access control decisions 
independently. As a result, their decisions may be incoherent as a whole. 

It is assumed that each processor has its own scheduler in this chapter. Each scheduling, resource access control or 
synchronization algorithm will be evaluated to see how much the algorithm relies on the current global information, how 
much coordination among schedulers is required and therefore how suitable the algorithm is for loosely coupled systems. 

Identical versus Heterogeneous Processors 

It is said that the processors are of the same type or identical, if the processors can be used interchangeably. For example, 
in a parallel machine, each of the CPUs can execute every computation job in the system, so the CPUs are identical. If any 
message from a source to a destination can be sent on any of the data links connecting them, then the links are identical. 

Different types of processors cannot be used interchangeably. Different types of processors may have different functions. 
As an example, CPUs, file disk, and transmission links are functionally different. So, they cannot be used interchangeably. 
Processors can be of different types for many reasons. For example, if the designer decides to use some CPUs for only 
some components of the system but not others, then the CPUs are divided into different types according to the components 
that can execute them. In a static system, the application system is partitioned into p components and jobs in each 
component execute on a fixed CPU. CPUs are viewed as p different processors. 

The model of heterogeneous processors used here is known as the unrelated processors model in scheduling theory 
literature. According to this model, each job can execute on some types of processors but, in general, not all types. 

Different types of processors may have different speeds. The execution times of each job on different types of processors 
are unrelated hence the name of the model. For example, the execution time of a computation intensive job is 1 second in 
CPU1 but is 5 seconds on a less powerful CPU2. Because CPU2 has better interrupt handling and I/O capabilities, the 
execution time of an I/O intensive job is 10 seconds on CPU1 but is only 3 seconds in CPU2. Both jobs cannot execute on 
a transmission link and signal processor, so their execution time on these kinds of processors are infinite. The unrelated 
models allows us to characterize all system. 

Processor is denoted by the letter P. the system contains p types of processors. There are a total of mi processors of type i 
i.e P for i = 1, 2, 3,.. ., m. (Liu, 2003, pp. 345-346) 

End-to-End Jobs and Tasks 

A task in real time of a monitor system consists of three jobs: sampling, encoding, and processing the reading of a sensor 
on a field processors; sending the sensor data by a communication processor to the central control processor; and 
correlating and displaying the data with other sensor data on the control processor. 

Job shops and flow shops 

Concurrency in a multiprocessor system arises as jobs of different tasks sequencing through different processors in a 
pipeline manner. The classical jobs shop and flow shop model captures this types of concurrency. According to the job 
shop model, each task Ti is a chain of n(i) jobs denoted by Ji kfor k = 1,2, 3,. . . n(i) for all l<k<n(i). The adjacent jobs 
Jut and Ji,k+i on the chain execute on different processors Ji,k+i become ready for execution only when LkCompletes. For 
example, the real-time monitor task is a chain of three jobs. 

We can specify the processors on which n(i) jobs in each task Ti execute by the visit sequence Vi - ( vu , V w , ■ ■ ■, Vm®) of 
the task. The k ,h entry V i k in this sequence gives the name of the processors on which the job Ji, k executes. Therefore, the 
visit sequence of the real time monitor task is field processor, communication processor, and control processor. 

For example, visit sequence Vi = (P=, P 2 ), V 2 = (P 2 , P 3 , P4, P3, Pi) of tasks Ti and T 2 in a system means that Ti has two jobs 
Ju on Pi followed by Ji, 2 on P 2 . T 2 has five jobs and they execute in turn on P 2 then on P 3 , P 4 , Pi and finally on Pi. 
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A flow shop is a special job shop in which all the task have the same visit sequence. For example, in a real-time monitor 
system, each task samples, processes and displays, the readings of a different sensors. Therefore, the visit sequence of all 
the monitor task are V = (field processor, communication processor, control processor). If the system has only one 
processor of each type. 

End to End Timing Constraints 

The timing constraints that can be derived directly from the high-level requirement of the applications are typically end to 
end in nature. They give the release time and deadline of each task as a whole. In this, the release time r, of the task T in a 
job shop is the release time of the first job J u and the deadline Di of the task is the deadline of its last job T„ because the 
timing constraint of such task are improved on the jobs at the two ends of the task, they are called end to end release time 
and deadline is called end to end task. 

Periodic End to End Task 

An End to end task T is periodic with period Pi if a chain of n(i) jobs is released every Ti or more units of time and the 
jobs in the chain executes in turn on processors according to the visit sequence ( Vu, Vi,2,.. .., Vg<i)). 

The end to end periodic task T is called parent task n(i) subtask and the subtask in the same parent task are called 
sibling/(sub)task. The period of an end to end periodic task is the period of its first subtask. In this, the phase <1> of the task 
T means the release time of first job of first sub-task. Similarly, execution time ei,t means the maximum amount of time 
required to complete any job in Ti, t . 

References 
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Model of Multiprocessor and Distributed Systems 



1. A multiprocessor system is tightly coupled so that global status and workload information on all processors can be kept 


current at a low cost. 

2. Here, it is assumed that each processor has its own scheduler. 

3. The processors are of same type or they are identical if the processors can be used interchangeably. 

4. Processors of different types cannot be used interchangeably. 

5. According to the model of heterogeneous processors i.e. unrelated processors, each job can execute on some types of 
processors but not all types. 

6. According to the job shop model, each task Ti is a chain of n(i) jobs denoted by Lkfor k.= I, 2, 3,. . . n(i) for all l<k<n(i). 

7. A flow shop is a special job shop in which all the task have the same visit sequence. 
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9.2 Task Assignment 


Task Assignment 

Most of the hard real-time systems are static in nature. To execute such hard real-time system in a multiprocessor 
environment, the application system is partitioned into modules or sub tasks and each module assigned and bound to 
different individual processors. This process is called task assignment. Generally, the task assignment is done offline but if 
the execution time, response requirement, data and control dependencies and the timing constraints are known then the 
task assignment can be done online. In this case, it is necessary to perform an acceptance test to decide whether to execute 
each new task or not. There are three different types of task assignment methods. They are Simple Bin-Packing 
formulation. Variable Size-Bin-Packing formulation and RMFF Algorithm. The first method ignores both the cost of 
communication and the placement of resources. The second one considers communication cost only and the third one 
considers both communication cost and resource accessed cost. 

Task Assignment based on Execution Time Requirements (Simple Bin-Packing 
Formulation) 

The task assignment method that ignores the communication cost and resource access cost and only considers the 
execution time requirement is called simple bin packing formulation. It is used to determine whether the number and kinds 
of processors planned for the systems are adequate. It is used for online acceptance test and load balancing. 

Consider that the utilization of n periodic task is given and it is asked that the system should be portioned into modules in 
such a way that the tasks in each module are schedulable in a process according to a uniprocessor scheduling algorithm. A 
task assignment can be defined by the subset of task in every module. We can say that the assignment requires ‘m’ 
processors if it partitions the ‘n’ task into ‘m’ schedulable modules. The quality of the task assignment is measured by the 
number of processors required by the assignment. 

In order to schedule ‘n’ independent preemptable periodic task with relative deadlines equal to their periods based on EDF 
algorithm, we need to consider the utilization. In this case, the utilization of all the task is more than one, therefore it is not 
possible schedule feasibly on a single processor. To determine how to partition the task so that individual modules can be 
scheduled on minimum number of processors. We need to know the utilization m for i = 1,2, 3,..., n of all the tasks. We 
know the individual task of module is schedulable if their total utilization is less than or equal to one. However, it is 
constrained that the total utilization of all the tasks in each module should be less than or equal to some value U<1. From 
this, we can formulate a task assignment problem as the simple bin packing problem in which the sizes of all the bins are 
equal to U and the sizes of items to be packed into the bins are ui = 1, 2,..., n. the number of bins required to pack all the 
item is the number of processors required to feasibly schedule all the n tasks. 

Multiprocessor Priority Ceiling Protocol 

The multiprocessor priority ceiling assumes that the tasks and resource have been assigned and statically bound to 
processors and also the scheduler of every synchronization processor knows the priorities and resource requirements of all 
the tasks requiring the global resources managed by the processor. It is also assumed that the resources used by every job 
during nested critical sections lies on the same processor. 

According to this protocol, the scheduler of each processor schedules all the local tasks and global critical sections on the 
processor on a fixed priority basis and controls their resource accesses according to the basic priority ceiling protocol. 
According to the MPCP model, when a task uses a global resource, its global critical section executes on the 
synchronization processor of the resource. If the global section of a remote task were to have a lower priority than some 
local task on the synchronization processor, these local tasks could delay the completion of the global critical section and 
prolong the blocking time of the remote task. For preventing this, the multiprocessor priority ceiling protocol schedules all 
the global critical sections at higher priorities than all the local tasks on every synchronization processor. This can be 
implemented in a system where the lowest priority 7tiowe> t all the tasks is known. The scheduler of each synchronization 
processor schedules the global critical sections of a task with priority Hi at priority 7ti - 
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As an example, consider a system where tasks have priorities 1 through 5,7[i„„, is 5. A global critical section of a task with 
priority 5 is scheduled at priority 0, which is higher that priority 1. Also, the priority of a global critical section of a task 
with priority 1 is - 4 which is the highest priority in the system.(Liu, 2003, pp. 366-368) 

Blocking Time due to Resource Contention 

The figure below shows the illustration that the types of blocking a job may suffer under the multiprocessor priority 
ceiling protocol. 

The system has two processors PI and P2. Jobs J2, J4 and J5 are local to processor PI, which is the synchronization 
processor of the resource dotted. Dotted is a local resource since it is only required by local jobs J2 and J4. Jobs J1 and J3 
are local to processor P2, which is the synchronization processor of the resource black. Black is a global resource and is 
required by Jobs J1, J2 and J3. The jobs are arranged in decreasing order. The short vertical bar on the timeline of each 
jobs marks the release time of the job. 

Here, J2 is directly blocked by J4 when J2 requests dotted at time 4. J1 is directly blocked by J3 when J1 requests black at 
time 3. The global critical section of J2 on P2 is delayed by the global critical section of the higher priority J1 in the time 
interval (7, 11]. The preemption delay thus suffered by J2 must be taken into account when the schedulability of J2 is to be 
determined. This delay is treated as J2’s blocking time. 

At time 11, J1 exits from its critical section. Its priority is lower than the priority of the global critical section of J2. As a 
result, J2 preempts J1 on P2 in the interval (11, 12]. The total delay experienced by ajob as a result of preemption by 
global critical sections of lower priority jobs is also a factor in the total blocking time of the job. 

Finally, ajob can be delayed by a local higher priority job whose execution is suspended on the local processor when the 
priority job executes on a remote processor. This delay is another factor of the blocking time of the job. Here in this 
example, J2 is suspended at time 7. As a result, it is still not completed in (13, 14] and J5 released at time 13.2 cannot start 
until time 14. 
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Task Assignment 


1. To execute the hard real time system in a multiprocessor environment, the application system is partitioned into modules 
or sub tasks and each module assigned and bound to different individual processors. This process is called task 
assignment. 

2. There are three different types of task assignment methods. They are Simple Bin-Packing formulation, Variable Size-Bin- 
Packing formulation and RMFF Algorithm. 

3. The task assignment method that ignores the communication cost and resource access cost and only considers the 
execution time requirement is called simple bin packing formulation. 

4. Simple bin packing is used to determine whether the number and kinds of processors planned for the systems are 
adequate. 

5. According to this protocol, the scheduler of each processor schedules all the local tasks and global critical sections on the 
processor on a fixed priority basis and controls their resource accesses according to the basic priority ceiling protocol. 
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9.3 Elements of Scheduling Algorithms for End to End Periodic Tasks 


Elements of Scheduling Algorithms for End-to-End Periodic Tasks 

End to end scheduling has two essential components: 

1. Protocols for synchronizing the execution of siblings subtasks on different processors so that precedence constraints 
among subtask are maintained. 

2. Algorithms for scheduling subtasks on each processor. 

According to the end to end scheduling approach, the fact that no subtasks in the system ever requires remote resource 
makes it possible for the scheduler on each processor to use any on uniprocessor scheduling algorithms and resource 
access protocols to schedule subtasks on the processor and control their accesses to resources. It is even possible for the 
system to use a mixture of more than one scheduling strategy. For example, using dynamic priority schemes on some 
processors while using fixed priority schemes on the others, or make some processors priority driven and others clock 
driven. 

Most of the real life systems use mixed scheduling strategies on different processors. For example, an end to end task 
whose subtasks compress video images periodically on a computer transmits video stream across a network whereas 
decompresses and displays the video on other computer. The computers at the two ends schedules the compression and 
decompression subtask differently from the way the network schedules the transmission of the video. 

Interprocessor Synchronization Protocols 

The jobs of the first sub-task T u of every end to end periodic task Ti are released no less than pi units of time apart. In a 
multiprocessor system, how jobs in sibling subtasks are released critically affects the schedulability, completion time jitter 
and average response time of end to end to tasks. So, a protocol that governs when the schedulers on different processors 
release the jobs the sibling subtasks is called an execution synchronization protocol. It is simply called a synchronization 
protocol because there is no ambiguity. 

A synchronization protocol is correct if it, 

1. Never releases jobs in any first subtask before the end to end release times of the jobs, and 

2. Never allows the violation of any precedence constraint among sibling subtasks. 

There are two types of synchronization protocols: 

1. Greedy Synchronization Protocol (work-conserving) 

2. Non-Greedy Synchronization Protocol (nonwork-conserving) 


Greedy Synchronization Protocol 

It is a commonly used synchronization protocol, especially in real time system. It can be implemented in many different 
ways. It can be implemented as follows: when the j* job of Tucompletes on Vi, k , the scheduler of Vu sends a 
synchronization signal to the scheduler of V ik+ ion which the successor subtask Ti,k+1 is executed. After receiving the 
synchronization signal, the scheduler of Vu+i releases the corresponding job T m a immediately. Since, the multiprocessor 
system model used here takes into account interprocessor communication costs, the delay in synchronization signal 
delivery id let to be 0. 

The greedy synchronization protocol is a simple protocol. It does not require any global clock synchronization. The only 
global information it requires is the identities of the immediate upstream processor and the next downstream processor in 
the visit sequence of each task which is executed on the processor. The greedy synchronization protocol yields the shortest 
average end to end response time of all tasks compared with non-greedy protocols. But the inter-release intervals of 
consecutive jobs in a later subtask can be shorter than the period of the subtask. This can by illustrated by the following 
example. 
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Consider a system with two processor and three tasks: Ti = (6, 3), T 3 = (6, 10, 4) do not have subtasks, and they are 
executed on Pi and P 2 . T 2 has Subtasks: Tu = (9,3) executes on Pi and T 22 = (9, 3) executes on P 2 . The relative end to end 
deadlines of the tasks are equal to their respective periods. The tasks are given fixed priorities on both the processors. 

Ti has higher priority on Pi and T 2 2 has a higher priority on P 2 . Figure (a) shows the schedule of the tasks when they are 
synchronized on the basis of greedy protocol. The dashed vertical arrows at times 6 and 12 represents synchronization 
signal between processors. Since, the inter-release times of jobs in T 2 , 2 can be as short as 6, T, cannot meet its deadlines. 
Figure (b) shows that if the jobs in T 22 are released periodically. T 3 always meets its deadlines.(Liu, 2003, pp. 374-376) 

End-to-End Tasks in Heterogeneous Systems 

When the processor time demand of every subtask T,, k on a processor meets the design constraint of the scheduling 
algorithm used by the processor, an upper bound Wu to the subtask’s response time is obtained on the processor without 
considering tasks that execute on other processors. Without loss of generality, it is assumed that the scheduler on every 
processor uses a real-time scheduling algorithm. 

In a system of end to end periodic tasks, one of the way to keep the worst case end to end response time of every periodic 
task small and end to end schedulability analysis simple is to use a non-greedy interprocessor synchronization protocol 
that reshapes the release time pattern of every subtask to make the subtask periodic. The modified phase protocol (MPM) 
and release guard (RG) protocol are examples of such protocols. They do not require priority driven scheduling to be used 
on individual processors.(Liu, 2003, pp. 398-399) 
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Corollary 

In a system where a real time algorithm is used to schedule sub-tasks on every processor, an upper bound Wi to the end to 
end response time of any perio dic task Ti synchronized to the MPM protocol or the RG protocol is given by 

n(i) 

Wi = w i.k 

k= 1 


Where n(i) is the number of subtasks in T, and the upper bound Wi* to the response time of every subtask T,, k is obtained by 
considering only subtasks on the same processor as Ti* and by treating every subtask T,., as a periodic task whose period is 
equal to the period Pj of the parent task Tj. 


Predictability and Validation of Dynamic Multiprocessor Systems 


Predictable execution of a set J of jobs is defined in terms of 3 possible schedules of J according to the given scheduling 
algorithm: the maximum, minimal and actual schedules. If every job in J were to execute for as long as its maximum 
execution time (or short as its minimum execution time), the resultant schedule of J would be the maximal (or minimal) 
schedule. These schedules can be easily be constructed as long as the range of execution time of every hard real time job is 
known. The actual execution times of jobs are unknown, and therefore, the actual schedule of J is unknown. 
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Example illustrating scheduling anomalies 


The figure below illustrates that in a dynamic system, the actual start time s(T) of a job ^according to the actual schedule 
of J can be later or earlier than its start time s + (Ji) or s-(Ji) according to the maximal schedule or minimal schedule. The 
start time of T is said to be unpredictable if this condition occurs. Similarly, the actual completion time f(J,) according to 
the actual schedule of J can be later than its completion time f + (J0 of Ji according to the maximum schedule. The 
completion time of J, is unpredictable if this is true. On the other hand, if s (Ji) < s(ji) < s + (J,) then J, is start time predictable 
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and if f (J,) < f(ji) < f + (JO, then J, is completion time predictable. The execution of L is predictable if J, is both start time and 
completion time predictable. The execution behavior of the entire set J is predictable if every job in J is predictable. 

The above example shows that the execution of jobs in dynamic, priority driven multiprocessor is unpredictable in general, 
but this system can have predictable execution under several conditions. According to the maximal and minimal 
schedules, when one of these conditions are satisfied, the response time of each job gives the upper and lower bounds of 
the response time.(Liu, 2003, pp. 400-401) 

Validation of Preemptable/Migratable system 

A dynamic system in which jobs are scheduled on a priority driven manner on ‘m’ processors is called as a 
preemptable/migratable system. Every job that can be dispatched to execute on any processor can be preempted at any 
time and when preempted, they can be resumed on any processor. That is, jobs can be migrated among processors. A 
condition for predictability of a preemptable or migratable system is: the jobs have no precedence constraints and do not 
contend for resources. 

Theorem 

The execution of a system of preemptable/migratable jobs is predictable if all the jobs have fixed release times, are 
independent, and do not contend for resources. 

Other conditions for Predictability 

The system shown in above figure is a preemptable or migratable system. In such system, jobs are scheduled in priority 
driven manner and every job can be dispatched to execute on any processor. However, once a job starts on a processor, it 
is constrained to execute on that processor. Job migration is too costly in most multiprocessor and distributed systems so, 
this scheduling strategy is commonly used. Under the theorem stated below, the execution of preemptable or migratable, 
independent jobs is predictable. 

Theorem 1 

If in a system of preemptable/nonmigratable, independent jobs, preemption can never occur, then the execution of the jobs 
is predictable. 

Theorem 2 

If according to the maximal schedule of a system of preemptable/nonmigratable independent jobs, no job is preemptable 
and the jobs start in the same sequence according to the maximal and minimal schedules, then their execution is 
predictable. 
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Elements of Scheduling Algorithms for End to End Periodic Tasks 


• Note 



1. It is possible for the system to use a mixture of more than one scheduling strategy. 

2. Most of the real life systems use mixed scheduling strategies on different processors. 

3. The jobs of the first sub-task Ty of every end to end periodic task Ti are released no less than pi units of time apart. 

4. A synchronization protocol is correct if it (a) never releases jobs in any first subtask before the end to end release times of 
the jobs, and (b) never allows the violation of any precedence constraint among sibling subtasks. 

5. The greedy synchronization protocol is a simple protocol. It does not require any global clock synchronization. 

6. According to the maximal and minimal schedules, when one of these conditions are satisfied, the response time of each 
job gives the upper and lower bounds of the response time. 

7. A dynamic system in which jobs are scheduled on a priority driven manner on ‘m’ processors is called as a 
preemptable/migratable system. 


84 

For more CSIT notes by ASCOL teachers and students ping: http://csitascolhelp.bloqspot.com/ 




10.1 Model of Real Time Communication 


Things To Remember 


Model of Real-Time Communication 


The figure below shows the well-known model of distributed systems. The hosts are connected by a communication 
network or several interconnected networks. The top layers are simplified and all the entities above all the transport layer 
applications are called. 

Architectural Overview 


The source and destinations of every message are application tasks residing on different hosts. The network interface of 
each host contains an input queue and an output queue which can also be referred to as input/output buffers or simply 
buffers. For the sake of correctness, it is assumed that these queues are jointly maintained by two local servers: the 
transport (TP) handler and Network Access Control (NAC) handler. The TP handler interfaces with local applications and 
provides them with message transport services. The NAC handler interfaces with the network below and provides network 
access and messages transmission services to the TP handler. 


Host i Host k 



Figure shows the data paths indicated by heavy arrows traversed by messages in and out of two hosts. The circles marked 
TPH and NACH are TP and NAC handlers, respectively. When requested to send a message by a local application task, 
the source TP handler places the message in that output queue. From there, each outgoing message is delivered to the 
network under the control of the source NAC handler. After the message has traversed to the network, the destination 
NAC handler places the message in the input queue and notices the destination TP handler. Then, the destination TP 
handler moves the message to the address space of the destination application task and notifies the application of the 
arrival of the message. (Liu, 2003, pp. 235-436) 


85 

For more CSIT notes by ASCOL teachers and students ping: http://csitascolhelp.bloqspot.com/ 





















Packets 


Messages are fragmented into segments before transmission through communication network. Each segment is handled by 
the network as a basic transmission unit called a frame, a packet or a cell. The transmission of the unit is non preemptable. 
Example, a packet is a 53-byte cell in an ATM network. Queuing and propagation delay is also measured in terms of this 
time unit: the length of delay is measured in terms of the number of packets that can be transmitted in that length of time. 
Furthermore, the length of each transmission link is measured by the number of packets that can be transmitted within the 
length of time they take to reverse the link. 

Real Time Traffic Models 

In real time communication, the term ‘real time traffic’ means isochronous of synchronous traffic consisting of message 
streams which are generated by their sources on a continuing basis and delivered to their respective destinations on a 
continuing basis. Such traffic includes periodic and sporadic messages that require some guarantee for on time delivery. 
There are also aperiodic messages. 

Each of these type of message is referred to as a message stream and is denoted by Mi for some index i to distinguish it 
from other messages. A message instance of packet arrives or departs at a point in time when the last bit in it arrives or 
departs. 

Periodic and Aperiodic messages 

The messages that are generated and consumed by periodic tasks are periodic messages. The characteristics of periodic 
messages are similar to the characteristic of their respective source tasks. The transmission of a periodic message is 
periodic task. For example, message streams carrying sensor data and actuator commands generated and consumed by 
digital controllers. 

A periodic message is denoted by Mi by the tuple (pi, &, Di). It means that the inter-arrival times of instances in Mi are 
never less that the period pi of the message, the maximum length of instances in Mi is equal to a packets and each instance 
must be delivered to the destination within Di units of time from its arrival at the source. Di is the relative deadline to Mi. 
This traffic model is called the peak rate model in real time communication. (Liu, 2003) 

There are also aperiodic message streams. The transmission of an aperiodic message is an aperiodic task. An aperiodic 
message stream does not have a relative deadline. However, it is efficient to keep the average delay suffered by an 
aperiodic message instances as small as possible. 

Sporadic Messages 

According to FeVe Model, a sporadic message Mi is characterized by a 5-tuple (pi, pi’, I 1( e,, Di). The parameters p„ e„ and 
Di are the minimum inter-arrival time, maximum length and relative deadline of the instances in Mi respectively. Pi’ is the 
average inter-arrival time of the instances of Mi where the average is taken without waiting for the arrivals of length L In a 
switched network, a switch transmits each packet without waiting for the arrivals of later packets in the same message 
instance. The FeVe model simplifies to Mi = (p, p,’, k, Di). the packets in Mi never arrive less than pi units of time apart, 
and their average inter-arrival time over any time interval of length I, is pi’. The maximum length of each instance of Mi is 
1 and omitted in the tuple. 

Performance Objectives and Constraints 

We want to measure the performance of scheduling, synchronization, and flow control algorithms used for real time 
communication and the performance of the performance of the resultant communication system along two dimension: 
from the points of view of the user and the system. The user is concerned with the on time delivery of periodic and 
sporadic messages and the average response time of aperiodic messages. 

Miss rate: the fraction of all message instances or packets that are delivered to their destinations too late. 

Loss rate: gives the fraction of all message instances in the stream that are dropped en route for flow and congestion 
control reasons. 

Invalid rate: combination or miss and loss rate (sum of miss and loss rate) 

Delay Jitter: the variation in the delays suffered by different message instances or packets in the stream. 

Buffer Requirement: a packet that arrives too early to be processed by the destination must be buffered. So, a larger 
delay jitter of a message stream means that more buffers must be provided by the stream. 

Throughput: the rate of each message stream measures the throughput of the stream. 

Real Time Connections and Service Discipline 

According to the connection oriented approach, a logical simplex connection from the source to the destination is set up 
for the transmission of each message stream. All packets on each connection are sent along a fixed route. 
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Admission control and Connection Establishment 

The use of a fixed route for each connection allows each switch en route to set aside the required bandwidth and buffer 
space for the connection so the network can provide some form of performance guarantee. This also enables the control 
over packet transmission to be done on a per connection basis. 

The client declares the characteristics of the message stream and the required performance of the connection to request a 
connection. The characteristics are defined by parameters of message stream to be carried on the connection. These 
parameters are collectively called flow specification. The required performance is stated in terms of quality of service 
parameters like delay, jitter, and so on. The admission controller of each handler and switch along the chosen route uses 
these parameters as the basis of an acceptance test to determine whether to admit the connection. The connection is 
admitted if the requested quality of service is existing connections service by the handler or switch. 

Packet Switched Networks 

The figure ‘a’ illustrates a packet switched network. Figure ‘b’ shows m*m switch; it has m inputs and m output links, 
both called links 1, 2,.. ., m. The switch routes packets on its input links to its output links. 



Each switching pattern can be represented by a permutation of m tuple (1,2,..., m). for example, for a 4*4 switch, the 4 
tuple (2, 4,1, 3) means that a packet on input link 2 goes to the queue of output link 1, a packet on input link 4 goes to the 
queue of output link 2, and so on. The switch is nonblocking which means that every pattern represents a possible 
switching pattern. (Liu, 2003, p. 440) 

Service Discipline 

The combination of an acceptance test and admission control protocol, and synchronization protocol and a scheduling 
algorithm used for the purpose of rate control, jitter control and scheduling of packets transmission is called a service 
discipline. Rate control and jitter control serve the purpose of flow control for real time. Service discipline are of two 
types: 
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1. Rate allocating 

2. Rate controlled 

Rate allocating discipline allows packets on each connection to be transmitted at higher rates than the guaranteed rate 
provided the switch can still meet the guarantees to all other connections. A service discipline is rate controlled if it 
ensures each connection the guaranteed rate but never allows packets on any connection to be sent above the guaranteed 
rate. A bandwidth preserving server that claims all the background time is allocating. 
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Model of Real Time Communication 

• Note 


1. Each segment of fragmented message is handled by the network as a basic transmission unit called a frame, a packet or a 
cell. 

2. The messages that are generated and consumed by periodic tasks are periodic messages. The transmission of a periodic 
message is periodic task. 

3. The transmission of an aperiodic message is an aperiodic task. 

4. a sporadic message Mi is characterized by a 5-tuple (pi, pi’, I,, ei, Di). 

5. According to the connection oriented approach, a logical simplex connection from the source to the destination is set up 
for the transmission of each message stream. 

6. The combination of an acceptance test and admission control protocol, and synchronization protocol and a scheduling 
algorithm used for the purpose of rate control, jitter control and scheduling of packets transmission is called a service 
discipline. 
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10.2 Priority Based Service Discipline for Switched Networks 


Priority based Service Discipline for switched Networks 

According to the priority based service discipline, the transmission of ready packets are scheduling priority driven manner. 
Among this, the well-known are Delay Earliest Due-Date (Delay-EDD), weighted fair queuing, jittered EDD and Rate 
Controlled Static Priority (RCSF). 

Weighted Fair Queuing Discipline 

It is a packet by packet generalized processor sharing algorithm. It is a type of rate allocating service discipline that 
provides each flow with at least it’s proportional fair share of link capacity & isolates the timing between flow. 

A packet switch may have several inputs feeding to an output link shared by an established flows. Each flow T is 
allocated to all ‘n’ connections is 


Assume that an acceptance test reject the connection that will cause the requested bandwidth to execute the available 
bandwidth. 


Scheduling Packets: 

Assume that the switch is output buffer. The output buffer comprises of two sets of queues. 

1. A set of FIFO queues for the connection 

2. A priority ordered shortest finish number (SEN) queue. 

The finish number fa represents the job completion time and the entry in the queue is represented as fa, i) where i is the 
ID of the connection ‘i’. The entry are sorted in order of the finish numbers. When a packet becomes ready on a FIFO 
queue, its finish number is calculated and the SEN queue is updated. The currently transmitted packet is never preempted, 
if the finish number fn is of new packet is lower than the current one. When a packet completes transmission, it is removed 
from the head of FIFO’s and SFN queues. If the FIFO queue is still backlogged then, the SFN queue is updated with the 
finish number of new ready packet. 


Computing Finish Numbers 

Let the total bandwidth of backlogged flows be U b , the finish number of the link is FN, the current time ‘t’ and the 
previous time instance ti when FN and UbWere updated. 

Rules for comparing the first finish number of a link busy interval 

R1 For as long as the link is idle, FN = 0, U b = 0, ti = 0, and finish number connection i, for every i = 1,2, ... , n is 0. 
R2 When the first packet of length e arrives and starts a busy intervals of the link, 

1. Set ti = t; and 

2. The packet being a connection-i packet, increment U b by u’, complete fni = fhr + e/Ui’, and insert the entry (fn,, i) in the 
SFN queue. 

Rules for comparing subsequent finish numbers during a link busy interval 

R1 for every i, when a connection-i packet arrives at t during a link busy interval, if connection i was idle prior to this 
arrival, 

1. Increment FN by (t - ti)/U b , computer fa = max(FN, Fa) + e/a’, and insert the entry (fa, i) in the SFN queue, and 

2. Set ti = t and decrement U b by a’. 

R2 for every I, when the transmission of a connection-i packet completes, 

1. If the connection remains backlogged, increment fa by e/a’, where e is the length of the new ready connection-i packet, 
and insert the entry (fa, i) in the SFN queue; 

2. If connection i becomes idle, 
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• increment the link number FN by (t - ti)AJ h ; and 

• Set ti = t, and decrement Ub by Ui’. 


Weighted Round Robin Service Discipline 

In a round-robin scheduling, the jobs are placed in a FIFO queue. The job at the head of the queue executes for one time 
slice if it does not complete within the time slice. If it does not complete within a time slice, it is preempted and put at the 
back of the queue. If there are n jobs in the queue, then each job gets on slice every n time slots. 

But in the weighed round robin scheduling, each job gets a weight wf. A job with the weight wtcxecutes for wt, time slices 
each round robin and the length of the round is equal to 


One of the most widely used WRR service discipline is called Greedy WRR. In this discipline during connection 
establishment, the scheduler at each switch assigns to the new connection i a weight of wt. 

It means that connection i is allocated wt slots in each round during which message packets on all existing connections 
sharing the same output link are transmitted in turn. 

Throughput and Delay guarantees 

Consider a constant bit rate periodic message Mi = (pi, ei, Di) where pi is minimum interarrival time of the message, ei is 
size of each message and Di is maximum acceptable end to end delay. 

On each round if more than wt packets are backlogged on the queue i, then wti packets are transmitted and each flow is 
guaranteed wt slots each round. But if the scheduling is rate allocating type, then more that wti packets can be send if there 
is nothing else to transmit. A designed parameter in greedy WRR is the maximum number of slots per round denoted by 
RL called round length. At all times, the sum of weights of all n connections on the output link is no greater than RL, i.e. 


It means each flow is guaranteed a throughput rate wt/RL of the link capacity provide that RL< p™„ (where, p„„„ is the 
minimum pi of all the i’s) and wti > [ei/(pi/RL)j i.e. wti > (e, RL)/pi 

The messages take at most (e,/wt,) * RL. At each subsequent switch round of packets arriving is sent in the next round. It 
means, one round delay is introduced at each hop. Therefore, the end to end delay for connection i with message size ei 
and assigned weight wt passing through r switches is bounded by 
Wi < a/wti + r-l)RL < pi + (r-l)RL 

Connection Establishment 

The weights of all connection on an output link depends on the length and a change of round length may require the 
changes of the weights. It is too costly to change the round length each time a row flow is established because it will 
require the adjustment of weights for all preexisting flows. Therefore, a fixed length round RL is used. 

1. At each hop, the scheduler computes the weight wti required to support the new flow. 

2. If the sum of existing weight is less than RL - wt, then the flow is accepted at that hop. 

In WRR scheduling, the flows get guaranteed capacity and it is efficient to implement but it is unsuitable for scheduling 
precedence constraint jobs and the end to end delay can be bounded. Similarly, since the scheduler is rate allocating, the 
jitter is not controlled but can be bounded, it is suitable for scheduling message transmission through switches and for 
pipelined jobs. It does not require globally synchronized clocks and the sorted queue. It is most suitable constant bit rate 
message. 
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Priority Based Service Discipline for Switched Networks 


3. 


According to the priority based service discipline, the transmission of ready packets are scheduling priority driven manner. 
Weighted Fair Queuing Discipline is a type of rate allocating service discipline that provides each flow with at least it’s 
proportional fair share of link capacity & isolates the timing between flow. 

In WRR scheduling, the flows get guaranteed capacity and it is efficient to implement but it is unsuitable for scheduling 
precedence constraint jobs and the end to end delay can be bounded. 
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10.3 Medium Access Control 


• iMS 

• Things To Remember _ 

Medium Access Control in CAN and IEEE 802.5 token Ring 

The size of a broadcast network is measured in terms of the ratio of the network round trip delay to the transmission time 
of a maximum length packet. If the ratio is small, every station can hear the transmission of every other station 
immediately after the transmission starts. The network is small when the ratio is small. Circulation control information 
among the stations in a small network takes small fraction of packet transmission time. The stations can coordinate 
decisions and actions without incurring significant performance penalty. So, they can carry out a centralized scheduling 
algorithm in a distributed manner. 

Fixed-Priority Scheduling in CAN 

Controlled Area Networks are the examples of small networks. CANs are used to connect component of embedded 
controllers. For example, an automobile control system, whose components control the engine, the brakes, the 
environment and so on. At the transmission rate of one Mbits per second, the end to end length of a CAN must be no 
greater than 50 meters. It means, within a fraction of a bit-time after statin starts to transmit, all the stations on the network 
can hear the transmission. Therefore, the network functionally behaves like a local bus. The output of all the stations are 
wire-ANDed together by the bus: the bit on the network during a bit-time is a logical 0 if the output of any station is a 0 
and a logical 1 only when the outputs of all stations are 1. The MAC protocol for CAN take advantage of this feature. 

(Liu, 2003, p. 468) 

Each message stream transmitted in a network has a unique message ID. Each packet in the stream begins with this ID, 
with the most significant bit first. A station on the network determines whether to receive a packet based on the ID number 
of the packet. Finally, a packet contains 1 to 8 bytes of data. 

CAN MAC protocol is a CSMA/CD (Carrier-Sense Multiple Access/Collision Detection) protocol. A station with a packet 
to send waits until it hears that the network is idle and then commences the transmit the ID number of the packet. At the 
same time the station listens. Whenever it hears a 0 on the network while it is transmitting a 1, it interrupts its own 
transmission. This way, network contention is resolved in favor of the packet with smallest ID among all contending 
packets. 

So, the packets in each message stream are given a fixed priority that is equal to the ID of the message. The smaller the ID 
the higher the priority. Packets are transmitted non-preemptively based on their priorities. 

Prioritized Access in IEEE 802.5 Token Rings 

In an IEEE 802.5 token ring network, packets are transmitted along a circular transmission medium in one direction. By 
breaking the network, a station transmits a packet and placing on the output link to the network. As the packet circulates 
around the network, the station identified by the destination address in the packet header copies the packet. When the 
packet returns to the source station, the station removes the packet from the network. 

Polling 

Network contention is resolved by a polling mechanism called token passing. For polling, each packet has an 8-bit Access 
Control field in its header. One of the bits in AC field is called the token bit. A station can determine whether the network 
is busy or free by examining tokenbit. As a polling packet circulates around the ring, the stations are polled in a round 
robin manner in the order of their physical locations on the ring. The polling packet is called the free token or simply token 
when there is no possibility of confusion. 

Schedulability Analysis 

For scheduling analysis, the following factors can be taken in account.(Liu, 2003, p. 470) 

1. Context Switching: A context switching time is equal to the amount of time required to transmit a free token, plus the 
round-trip delay of the network, which is an upper bound of the time the token takes to reach the station whose outgoing 
packets has the highest priority among all outgoing packets during the transmission of the latest data packet. 

2. Blocking: Since packets are transmitted non-preemptively, we need to take into account the blocking time due to non- 
preemptivity. A higher priority packet that arrives at a station just after the header of the current data packet passed the 
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station may need to wait for a lower priority packet. The blocking delay caused by this priority inversion must also be 
taken into account. Hence, the total blocking time is equal to twice the maximum execution time. 

3. Limited priority levels: Since the network provides only eight priority levels, schedulability loss should also be take into 
account 

Internet and Resource Reservation Protocol 

Resource reservation protocol supports multicast communication among members that may join and leave their multicast 
groups any time. Resource reservation protocol is separate from routing, admission control, connection establishment and 
data transmission. 

RSVP is a receiver initiated protocol designed to accommodate heterogeneous receivers which may desire different 
service qualities. An implicit assumption is that not only the group membership may change any time, but also the service 
quality desired by each individual member may change. Rather that requiring an explicit request to change reservation, 
resource reservations under RSVP must be removed at frequencies specified by the group members. Both path and 
reservation states maintained by each router are soft, meaning that the states are deleted if not renewed within specified 
time intervals. 

Another unique feature of RSVP is reservation style. It allows each receiver to specify which source or sources in the 
multicast group may use the resources reserved on its behalf and to dynamically change this specification if it so desires. 
This specification is provided to the routers in the form of a filter. A filter names the source whose message streams can 
use the resources reserved for the receiver. 

Issues in Resource Reservation 

The resource reservation protocol must deal with the following issues. 

1. Multipoint to multipoint communication 

2. Heterogeneity of destinations 

3. Dynamic multicast group membership 

4. Relation to routing and admission control 

5. Design objectives 

6. 
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Medium Access Control 

• Note 


[ Things To Remember) 


1. The size of a broadcast network is measured in terms of the ratio of the network round trip delay to the transmission time 
of a maximum length packet. 

2. If the ratio is small, every station can hear the transmission of every other station immediately after the transmission starts. 

3. Controlled Area Networks are the examples of small networks that are used to connect component of embedded 
controllers. 

4. In an IEEE 802.5 token ring network, packets are transmitted along a circular transmission medium in one direction. 

5. Network contention is resolved by a polling mechanism called token passing. 

6. RSVP is a receiver initiated protocol designed to accommodate heterogeneous receivers which may desire different 
service qualities. 
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10.4 Real Time Protocols 


Real Time Protocols 

This section gives a brief overview or real time protocol and real time control protocol. 

Data Transport 

The real time protocol is a data transport protocol. Real time protocol is designed to support multicast communication in 
interactive multimedia applications, such as audio and video teleconferencing and distributed simulation. It deals with data 
delivery and uses services provided by lower layer protocols. RTP can run on the unreliable datagram protocol UDP. It 
uses UDP’s multiplexing and error control services and compensates UDP with its own sequencing function. 

RTP is designed to be scalable from a few participants to thousands of participants in multicast communication and to be 
able to accommodate heterogeneity of sources, receivers and networks. RTP assumes that multicast group membership 
may change dynamically. RTP is often integrated with applications. According to RTP, the audio and video are 
transmitted separately. Each medium uses a separate multicast network address and a pair of UDP ports, one for data 
packets and one for control packets. 

A data packet consisting of the fixed RTP header, a possibly empty list of contributing sources and the payload data is 
called a RTP packet. An encapsulation of the RTP packet to be dcHAAAAAA* ned is required by some underlying 
protocols. Typically, one packet of the underlying protocol contains a single RTP packet, but several RTP packets may be 
contained if permitted by the encapsulation method. 

RTCP control Protocol 

The RTP control protocol (RTCP) is based on the periodic transmission of control packets to all participants in the session, 
using the same distribution mechanism as the data packets. The underlying protocol must provide multiplexing of the data 
and control packets, for example using separate port numbers with UDP. RTCP performs three major functions: 
(Schulzrine, 2003) 

1. The primary function is to provide feedback on the quality of the data distribution. This is an integral part of the RTP’s 
role as a transport protocol and is related to the flow and congestion control functions of other transport protocols. Sending 
reception feedback reports to all participants allows one who is observing problems to evaluate whether those problems 
are local or global. This feedback function is performed by the RTCP sender and receiver reports. 

2. RTCP carries a persistent transport-level idcntifi AAAA* er for an RTP source called the canonical name or CNAME. The 
CNAME is required by the receivers to keep track of each participant. The CNAME is also required to associate multiple 
data streams from a given participant in a set of related RTP sessions, for example, to synchronize audio and video. 

3. The fiAAAA* rst two functions require that all participants send RTCP packets, and hence the rate must be controlled so 
that RTP can scale up to a large number of participants. Each participant can independently observe the number of 
participants by sending their control packets to all the others. This number is used to calculate the rate at which the packets 
are sent. 

Communication in Multicomputer Systems 

Like packet switched networks, multihop networks used to interconnect processors I massively parallel machines also 
consist of crossbar switches connected by full duplex links. Unlike packet switched networks, these networks typically 
adopt a simpler routing and flow control scheme called wormhole routing. (Liu, 2003, pp. 499-500) 

Wormhole Networks 

In a wormhole network, messages are segmented into very small flow control units called flits. In a simple wormhole 
network, each switch provides only enough buffer space to hold on flit per input link. The buffer is there in order to 
decouple the input link from the output links. 

At each step, only one flit can occupy a link. While a message is using a link, another message that also need the link must 
wait. So, the transmission of a message may be blocked from starting. When the header i.e. the first flit of a message 
reaches a switch, the switch selects an output link for the message based on the information provided in the header. If the 
output link is free at the time, the header moves forward on that link to the next switch, leaving the input link it used to 
reach the current switch to the second flit in the message. Similarly, the third flit follows, using the link freed by the 


94 

For more CSIT notes by ASCOL teachers and students ping: http://csitascolhelp.bloqspot.com/ 




second flit, and so on. On the other hand, of an output link chosen for a message is in use, the header is buffered and waits 
at the switch until the output link becomes free. In the meantime, subsequent flitthat have reached upstream switches 
occupy flit buffers there, one per switch. The associated input links are not available to other messages. 

This is the routing phase of message transmission. The routing of a message starts when its header leaves the source 
processor and completes when the header reaches the destination processor. Hereafter, the message has all the links along 
the path between its source and destination. Each link is occupied by one of the message’s flits. The flit shifts downstream 
by one link in each step without intervention of the switches. Thus, the message “worms” its way non-preemptively 
through the network without being queued at any switch. Its transmission completes when its flit is delivered to the 
destination processor. 

The message delay through the wormhole network is the sum of its routing time and transmission time. The transmission 
time is the total propagation delay of all links on its transmission path. The time required to route a message depends on 
the overall network traffic and is the nonpredictable component of message delay. (Liu, 2003, p. 500) 
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Real Time Protocols 
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The real time protocol is a data transport protocol. 

Real time protocol is designed to support multicast communication in interactive multimedia applications, such as audio 
and video teleconferencing and distributed simulation. 

A data packet consisting of the fixed RTP header, a possibly empty list of contributing sources and the payload data is 
called a RTP packet. 

The RTP control protocol (RTCP) is based on the periodic transmission of control packets to all participants in the session, 
using the same distribution mechanism as the data packets. 

RTCP provides feedback on the quality of the data distribution. 
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